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(57) ABSTRACT 

The invention describes a new method to sequence DNA. 
The improvements over the existing DNA sequencing tech- 
nologies are high speed, high throughput, no electrophoresis 
and gel reading artifacts due to the complete absence of an 
electrophoretic step, and no costly reagents involving vari- 
ous substitutions with stable isotopes. The invention utilizes 
the Sanger sequencing strategy and assembles the sequence 
information by analysis of the nested fragments obtained by 
base-specific chain termination via their different molecular 
masses using mass spectrometry, as for example, MALDI or 
ES mass spectrometry. A further increase in throughput can 
be obtained by introducing mass-modifications in the oli- 
gonucleotide primer, chain-terminating nucleoside triphos- 
phates and/or in the chain-elongating nucleoside 
triphosphates, as well as using integrated tag sequences 
which allow multiplexing by hybridization of tag specific 
probes with mass differentiated molecular weights. 
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DNA SEQUENCING BY MASS 
SPECTROMETRY 

RELATED APPLICATIONS 

This application is a continuation-in-part of U.S. appli- 
cation Ser. No. 08/178,216 filed Jan. 6, 1994, which issued 
as U.S. Pat. No. 5,547,835, which itself is a continuation- 
in-part of U.S. application Ser. No. 08/001,323 filed Jan. 7, 
1993, which is now abandoned. The contents of all related 
applications are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Since the genetic information is represented by the 
sequence of the four DNA building blocks deoxyadenosine- 
(dpA), deoxyguanosine-(dpG), deoxycytidine-(dpC) and 
deoxythymidine-5'-phosphate (dpT), DNA sequencing is 
one of the most fundamental technologies in molecular 
biology and the life sciences in general. The ease and the rate 
by which DNA sequences can be obtained greatly affects 
related technologies such as development and production of 
new therapeutic agents and new and useful varieties of 
plants and microorganisms via recombinant DNA technol- 
ogy. In particular, unraveling the DNA sequence helps in 
understanding human pathological conditions including 
genetic disorders, cancer and AIDS. In some cases, very 
subtle differences such as a one nucleotide deletion, addition 
or substitution can create serious, in some cases even fatal, 
consequences. Recently, DNA sequencing has become the 
core technology of the Human Genome Sequencing Project 
(e.g., J. E. Bishop and M. Waldholz, 1991, Genome: The 
Story of the Most Astonishing Scientific Adventure of Our 
Time — The Attempt to Map All the Genes in the Human 
Body, Simon & Schuster, New York). Knowledge of the 
complete human genome DNA sequence will certainly help 
to understand, to diagnose, to prevent and to treat human 
diseases. To be able to tackle successfully the determination 
of the approximately 3 billion base pairs of the human 
genome in a reasonable time frame and in an economical 
way, rapid, reliable, sensitive and inexpensive methods need 
to be developed, which also offer the possibility of automa- 
tion. The present invention provides such a technology. 

Recent reviews of today's methods together with future 
directions and trends are given by Barrell (The FASEB 
Journal 5, 40-45 (1991)), and Trainor {Anal. Chem. 62, 
418-26 (1990)). 

Currently, DNA sequencing is performed by either the 
chemical degradation method of Maxam and Gilbert 
(Methods in Enzymology 65, 499—560 (1980)) or the enzy- 
matic dideoxynucleotide termination method of Sanger et al. 
(Proc. Natl. Acad. Set USA 74, 5463-67 (1977)). In the 
chemical method, base specific modifications result in a base 
specific cleavage of the radioactive or fluorescently labeled 
DNA fragment. With the four separate base specific cleavage 
reactions, four sets of nested fragments are produced which 
are separated according to length by polyacrylamide gel 
electrophoresis (PAGE). After autoradiography, the 
sequence can be read directly since each band (fragment) in 
the gel originates from a base specific cleavage event. Thus, 
the fragment lengths in the four "ladders" directly translate 
into a specific position in the DNA sequence. 

In the enzymatic chain termination method, the four base 
specific sets of DNA fragments are formed by starting with 
a primer/template system elongating the primer into the 
unknown DNA sequence area and thereby copying the 
template and synthesizing a complementary strand by DNA 
polymerases, such as Klenow fragment of E. coli DNA 
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polymerase I, a DNA polymerase from Thermus aquaticus, 
Taq DNA polymerase, or a modified T7 DNA polymerase, 
Sequenase (Tabor et al., Proc. Natl. Acad. Sci. USA 84, 
4767^1771 (1987)), in the presence of chain-terminating 

5 reagents. Here, the chain-terminating event is achieved by 
incorporating into the four separate reaction mixtures in 
addition to the four normal deoxynucleoside triphosphates, 
dATP, dGTP, dTTP and dCTP, only one of the chain- 
terminating dideoxynucleoside triphosphates, ddATP, 

10 ddGTP, ddTTP or ddCTP, respectively, in a limiting small 
concentration. The four sets of resulting fragments produce, 
after electrophoresis, four base specific ladders from which 
the DNA sequence can be determined. 

A recent modification of the Sanger sequencing strategy 

15 involves the degradation of phosphorothioate-containing 
DNA fragments obtained by using alpha-thio dNTP instead 
of the normally used ddNTPs during the primer extension 
reaction mediated by DNA polymerase (Labeit et al., DNA 
5, 173-177 (1986); Amersham, PCT-Application GB86/ 

20 00349; Eckstein et al., Nucleic Acids Res. 16, 9947 (1988)). 
Here, the four sets of base-specific sequencing ladders are 
obtained by limited digestion with exonuclease III or snake 
venom phosphodiesterase, subsequent separation on PAGE 
and visualization by radioisotopic labeling of either the 

25 primer or one of the dNTPs. In a further modification, the 
base-specific cleavage is achieved by alkylating the sulphur 
atom in the modified phosphodiester bond followed by a 
heat treatment (Max-Planck-Gesellschaft, DE 3930312 Al). 
Both methods can be combined with the amplification of the 

3Q DNA via the Polymerase Chain Reaction (PCR). 

On the upfront end, the DNA to be sequenced has to be 
fragmented into sequencable pieces of currently not more 
than 500 to 1000 nucleotides. Starting from a genome, this 
is a multi-step process involving cloning and subcloning 

35 steps using different and appropriate cloning vectors such as 
YAC, cosmids, plasmids and M13 vectors (Sambrook et al., 
Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, 1989). Finally, for Sanger 
sequencing, the fragments of about 500 to 1000 base pairs 

40 are integrated into a specific restriction site of the replicative 
form I (RF I) of a derivative of the M13 bacteriophage 
(Vieria and Messing, Gene 19, 259 (1982)) and then the 
double -stranded form is transformed to the single -stranded 
circular form to serve as a template for the Sanger sequenc- 

45 ing process having a binding site for a universal primer 
obtained by chemical DNA synthesis (Sinha, Biernat, 
McManus and Koster, Nucleic Acids Res. 12, 4539-57 
(1984); U.S. Pat. No. 4,725,677 upstream of the restriction 
site into which the unknown DNA fragment has been 

50 inserted. Under specific conditions, unknown DNA 
sequences integrated into supercoiled double -stranded plas- 
mid DNA can be sequenced directly by the Sanger method 
(Chen and Seeburg, DNA 4, 165-170 (1985)) and Lim et al., 
Gene Anal. Techn. 5, 32-39 (1988), and, with the Poly- 

55 merase Chain Reaction (PCR) (PCR Protocols: A Guide to 
Methods and Applications, Innis et al., editors, Academic 
Press, San Diego (1990)) cloning or subcloning steps could 
be omitted by directly sequencing off chromosomal DNA by 
first amplifying the DNA segment by PCR and then applying 

60 the Sanger sequencing method (Innis et al., Proc. Natl. Acad. 
Set USA 85, 9436-9440 (1988)). In this case, however, the 
DNA sequence in the interested region most be known at 
least to the extent to bind a sequencing primer. 

In order to be able to read the sequence from PAGE, 

65 detectable labels have to be used in either the primer (very 
often at the 5'-end) or in one of the deoxynucleoside 
triphosphates, dNTP. Using radioisotopes such as 32 P, 33 P, or 
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35 S is still the most frequently used technique. After PAGE, 
the gels are exposed to X-ray films and silver grain exposure 
is analyzed. The use of radioisotopic labeling creates several 
problems. Most labels useful for autoradiographic detection 
of sequencing fragements have relatively short half-fives 
which can limit the useful time of the labels. The emission 
high energy beta radiation, particularly from 32 P, can lead to 
breakdown of the products via radiolysis so that the sample 
should be used very quickly after labeling. In addition, high 
energy radiation can also cause a deterioration of band 
sharpness by scattering. Some of these problems can be 
reduced by using the less energetic isotopes such as 33 P or 
35 S (see, e.g., Ornstein et al., Biotechniques 3, 476 (1985)). 
Here, however, longer exposure times have to be tolerated. 
Above all, the use of radioisotopes poses significant health 
risks to the experimentalist and, in heavy sequencing 
projects, decontamination and handling the radioactive 
waste are other severe problems and burdens. 

In response to the above mentioned problems related to 
the use of radioactive labels, non-radioactive labeling tech- 
niques have been explored and, in recent years, integrated 
into partly automated DNA sequencing procedures. All these 
improvements utilize the Sanger sequencing strategy. The 
fluorescent label can be tagged to the primer (Smith et al., 
Nature 321, 674-679 (1986) and EPO Patent No. 
87300998.9; Du Pont De Nemours EPO Application No. 
0359225; Ansorge et al J. Biochem. Biophys. Methods 13, 
325-32 (1986)) or to the chain-termmating dideoxynuclo- 
side triphosphates (Prober et al. Science 238, 336-41 (1987); 
Applied Biosystems, PCT Application WO 91/05060). 
Based on either labeling the primer or the ddNTP, systems 
have been developed by Applied Biosystems (Smith et al., 
Science 235, G89 (1987); U.S. Pat. Nos. 570,973 and 
689,013), Du Pont De Nemours (Prober et al. Science 238, 
336-341 (1987); U.S. Pat. Nos. 881,372 and 57,566), 
Pharmacia-LKB (Ansorge el al. Nucleic Acids Res. 15, 
4593-4602 (1987) and EMBL Patent Application DE 
P3724442 and P3805808.1) and Hitachi (JP 1-90844 and DE 
4011991 Al). A somewhat similar approach was developed 
by Brumbaugh et al. iProc. Natl Set USA 85, 5610-14 
(1988) and U.S. Pat. No. 4,729,947). An improved method 
for the Du Pont system using two electrophoretic lanes with 
two different specific labels per lane is described (PCT 
Application W092/02635). A different approach uses fluo- 
rescently labeled avidin and biotin labeled primers. Here, the 
sequencing ladders ending with biotin are reacted during 
electrophoresis with the labeled avidin which results in the 
detection of the individual sequencing bands (Brumbaugh el 
al, U.S. Pat. No. 594,676). 

More recendy even more sensitive non-radioactive label- 
ing techniques for DNA using chemiluminescence trigger- 
able and amplifiable by enzymes have been developed 
(Beck, O'Keefe, Coull and Koster, Nucleic Acids Res. 17, 
5115-5123 (1989) and Beck and Koster, Anal Chem. 62, 
2258—2270 (1990)). These labeling methods were combined 
with multiplex DNA sequencing (Church el al. Science 240, 
185-188 (1988) to provide for a strategy aimed at high 
throughput DNA sequencing (Koster et al., Nucleic Acids 
Res. Symposium Ser. No. 24, 318-321 (1991), University of 
Utah, PCT Application No. WO 90/15883); this strategy still 
suffers from the disadvantage of being very laborious and 
difficult to automate. 

In an attempt to simplify DNA sequencing, solid supports 
have been introduced. In most cases published so far, the 
template strand for sequencing (with or without PCR 
amplification) is immobilized on a solid support most fre- 
quently utilizing the strong biotin-avidin/streptavidin inter- 
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action (Orion-Yhtyma Oy, U.S. Pat. No. 277,643; M. Uhlen 
et al. Nucleic Acids Res. 16, 3025-38 (1988); Cemu 
Bioteknik, PCT Application No. WO 89/09282 and Medical 
Research Council, GB, PCT Application No. WO 

5 92/03575). The primer extension products synthesized on 
the immobilized template strand are purified of enzymes, 
other sequencing reagents and by-products by a washing 
step and then released under denaturing conditions by loos- 
ing the hydrogen bonds between the Watson- Crick base 

10 pairs and subjected to PAGE separation. In a different 
approach, the primer extension products (not the template) 
from a DNA sequencing reaction are bound to a solid 
support via biotin/avidin (Du Pont De Nemours, PCT Appli- 
cation WO 91/11533). In contrast to the above mentioned 
methods, here, the interaction between biotin and avidin is 

15 overcome by employing denaturing conditions (formamide/ 
EDTA) to release the primer extension products of the 
sequencing reaction from the solid support for PAGE sepa- 
ration. As solid supports, beads, (e.g., magnetic beads 
(Dynabeads) and Sepharose beads), filters, capillaries, plas- 

20 tic dipsticks (e.g., polystyrene strips) and microtiter wells 
are being proposed. 

All methods discussed so far have one central step in 
common: polyacrylamide gel electrophoresis (PAGE). In 
many instances, this represents a major drawback and limi- 

25 tation for each of these methods. Preparing a homogeneous 
gel by polymerization, loading of the samples, the electro- 
phoresis itself, detection of the sequence pattern (e.g., by 
autoradiography), removing the gel and cleaning the glass 
plates to prepare another gel are very laborious and time- 

30 consuming procedures. Moreover, the whole process is 
error-prone, difficult to automate, and, in order to improve 
reproducibility and reliability, highly trained and skilled 
personnel are required. In the case of radioactive labeling, 
autoradiography itself can consume from hours to days. In 

35 the case of fluorescent labeling, at least the detection of the 
sequencing bands is being performed automatically when 
using the laser-scanning devices integrated into commercial 
available DNA sequencers. One problem related to the 
fluorescent labeling is the influence of the four different 

40 base-specific fluorescent tags on the mobility of the frag- 
ments during electrophoresis and a possible overlap in the 
spectral bandwidth of the four specific dyes reducing the 
discriminating power between neighboring bands, hence, 
increasing the probability of sequence ambiguities. Artifacts 

45 are also produced by base-specific interactions with the 
polyacrylamide gel matrix (Frank and Koster, Nucleic Acids 
Res. 6, 2069 (1979)) and by the formation of secondary 
structures which result in "band compressions" and hence do 
not allow one to read the sequence. This problem has, in 

50 part, been overcome by using 7-deazadeoxyguanosine triph- 
osphates (Barr et al., Biotechniques 4, 428 (1986)). 
However, the reasons for some artifacts and conspicuous 
bands are still under investigation and need further improve- 
ment of the gel electrophoretic procedure. 

55 A recent innovation in electrophoresis is capillary zone 
electrophoresis (CZE) (Jorgenson et al.,/. Chromatography 
352, 337 (1986); Gesteland et al., Nucleic Acids Res. 18, 
1415-1419 (1990)) which, compared to slab gel electro- 
phoresis (PAGE), significantly increases the resolution of 

60 the separation, reduces the time for an electrophoretic run 
and allows the analysis of very small samples. Here, 
however, other problems arise due to the miniaturization of 
the whole system such as wall effects and the necessity of 
highly sensitive on-line detection methods. Compared to 

65 PAGE, another drawback is created by the fact that CZE is 
only a "one-lane" process, whereas in PAGE samples in 
multiple lanes can be electrophoresed simultaneously. 
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Due to the severe limitations and problems related to 
having PAGE as an integral and central part in the standard 
DNA sequencing protocol, several methods have been pro- 
posed to do DNA sequencing without an electrophoretic 
step. One approach calls for hybridization or fragmentation 
sequencing (Bains, Biotechnology 10, 757—58 (1992) and 
Mirzabekov et al., FEBS Letters 256, 118-122 (1989)) 
utilizing the specific hybridization of known short oligo- 
nucleotides (e.g., octadeoxynucleotides which gives 65,536 
different sequences) to a complementary DNA sequence. 
Positive hybridization reveals a short stretch of the unknown 
sequence. Repeating this process by performing hybridiza- 
tions with all possible octadeoxynucleotides should theo- 
retically determine the sequence. In a completely different 
approach, rapid sequencing of DNA is done by unilaterally 
degrading one single, immobilized DNA fragment by an 
exonuclease in a moving flow stream and detecting the 
cleaved nucleotides by their specific fluorescent tag via laser 
excitation (Jett et al., J. Biomolecular Structure & Dynamics 
7, 301-309, (1989); United States Department of Energy, 
PCT Application No. WO 89/03432). In another system 
proposed by Hyman (Anal Biochem. 174, 423^136 (1988)), 
the pyrophosphate generated when the correct nucleotide is 
attached to the growing chain on a primer-template system 
is used to determine the DNA sequence. The enzymes used 
and the DNA are held in place by solid phases (DEAE- 
Sepharose and Sepharose) either by ionic interactions or by 
covalent attachment. In a continuous flow-through system, 
the amount of pyrophosphate is determined via biolumines- 
cence (lucif erase). A synthesis approach to DNA sequencing 
is also used by Tsien et al. (PCT Application No. WO 
91/06678). Here, the incoming dNTP's are protected at the 
3 '-end by various blocking groups such as acetyl or phos- 
phate groups and are removed before the next elongation 
step, which makes this process very slow compared to 
standard sequencing methods. The template DNA is immo- 
bilized on a polymer support. To detect incorporation, a 
fluorescent or radioactive label is additionally incorporated 
into the modified dNTP's. The same patent application also 
describes an apparatus designed to automate the process. 

Mass spectrometry, in general, provides a means of 
"weighing" individual molecules by ionizing the molecules 
in vacuo and making them "fly" by volatilization. Under the 
influence of combinations of electric and magnetic fields, 
the ions follow trajectories depending on their individual 
mass (m) and charge (z). In the range of molecules with low 
molecular weight, mass spectrometry has long been part of 
the routine physical-organic repertoire for analysis and 
characterization of organic molecules by the determination 
of the mass of the parent molecular ion. In addition, by 
arranging collisions of this parent molecular ion with other 
particles (e.g., argon atoms), the molecular ion is fragmented 
forming secondary ions by the so-called collision induced 
dissociation (CID). The fragmentation pattern/pathway very 
often allows the derivation of detailed structural informa- 
tion. Many applications of mass spectrometric methods in 
the known in the art, particularly in biosciences, and can be 
found summarized in Methods in Enzymology, Vol. 193: 
"Mass Spectrometry" (J. A McCloskey, editor), 1990, Aca- 
demic Press, New York. 

Due to the apparent analytical advantages of mass spec- 
trometry in providing high detection sensitivity, accuracy of 
mass measurements, detailed structural information by CID 
in conjunction with an MS/MS configuration and speed, as 
well as on-line data transfer to a computer, there has been 
considerable interest in the use of mass spectrometry for the 
structural analysis of nucleic acids. Recent reviews summa- 


rizing this field include K. H. Schram, "Mass Spectrometry 
of Nucleic Acid Components, Biomedical Applications of 
Mass Spectrometry" 34, 203-287 (1990); and P. F. Crain, 
"Mass Spectrometric Techniques in Nucleic Acid 

5 Research," Mass Spectrometry Reviews 9, 505-554 (1990). 
The biggest hurdle to applying mass spectrometry to nucleic 
acids is the difficulty of volatilizing these very polar biopoly- 
mers. Therefore, "sequencing" has been limited to low 
molecular weight synthetic oligonucleotides by determining 

10 the mass of the parent molecular ion and through this, 
confirming the already known sequence, or alternatively, 
confirming the known sequence through the generation of 
secondary ions (fragment ions) via CID in an MS/MS 
configuration utilizing, in particular, for the ionization and 

15 volatilization, the method of fast atomic bombardment (FAB 
mass spectrometry) or plasma desorption (PD mass 
spectrometry). As an example, the application of FAB to the 
analysis of protected dimeric blocks for chemical synthesis 
of oligodeoxynucleotides has been described (Koster et al. 

20 Biomedical Environmental Mass Spectrometry 14, 111-116 
(1987)). 

Two more recent ionization/desorption techniques are 
electrospray/ionspray (ES) and matrix-assisted laser 
desorption/ionization (MALDI). ES mass spectrometry has 

25 been introduced by Fenn et al. (J. Phys. Chem. 88, 4451—59 
(1984); PCT Application No. WO 90/14148) and current 
applications are summarized in recent review articles (R. D. 
Smith et al., Anal. Chem. 62, 882-89 (1990) and B. Ardrey, 
Electrospray Mass Spectrometry, Spectroscopy Europe, 4, 

30 10-18 (1992)). The molecular weights of the tetradecanucle- 
otide d(CATGCCATGGCATG) (SEQ ID NO:l) (Covey et 
al. "The Determination of Protein, Oligonucleotide and 
Peptide Molecular Weights by Ionspray Mass 
Spectrometry," Rapid Communications in Mass 

35 Spectrometry, 2, 249-256 (1988)), of the 21 -mer 
d(AAATTGTGCACATCCTGCAGC) (SEQ ID NO:2) and 
without giving details of that of a tRNA with 76 nucleotides 
(Methods in Enzymology, 193, "Mass Spectrometry" 
(McCloskey, editor), p. 425, 1990, Academic Press, New 

40 York) have been published. As a mass analyzer, a quadru- 
pole is most frequently used. The determination of molecu- 
lar weights in femtomole amounts of sample is very accurate 
due to the presence of multiple ion peaks which all could be 
used for the mass calculation. 

45 MALDI mass spectrometry, in contrast, can be particu- 
larly attractive when a time-of-fiight (TOF) configuration is 
used as a mass analyzer. The MALDI-TOF mass spectrom- 
etry has been introduced by Hillenkamp et al. ("Matrix 
Assisted UV-Laser Desorption/ionization: A New Approach 

50 to Mass Spectrometry of Large Biomolecules," Biological 
Mass Spectrometry (Burlingame and McCloskey, editors), 
Elsevier Science Publishers, Amsterdam, pp. 49—60, 1990.) 
Since, in most cases, no multiple molecular ion peaks are 
produced with this technique, the mass spectra, in principle, 

55 look simpler compared to ES mass spectrometry. Although 
DNA molecules up to a molecular weight of 410,000 daltons 
could be desorbed and volatilized (Williams et al., "Vola- 
tilization of High Molecular Weight DNA by Pulsed Laser 
Ablation of Frozen Aqueous Solutions," Science, 246, 

60 1585-87 (1989)), this technique has so far only been used to 
determine the molecular weights of relatively small oligo- 
nucleotides of known sequence, e.g., oligothymidylic acids 
up to 18 nucleotides (Huth-Fehre et al., "Matrix-Assisted 
Laser Desorption Mass Spectrometry of Oligodeox- 

65 ythymidylic Acids," Rapid Communications in Mass 
Spectrometry, 6, 209-13 (1992)) and a double -stranded 
DNA of 28 base pairs (Williams et al, "Trme-of- Flight Mass 
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Spectrometry of Nucleic Acids by Laser Ablation and Ion- 
ization from a Frozen Aqueous Matrix," Rapid Communi- 
cations in Mass Spectrometry, 4, 348-351 (1990)). In one 
publication (Huth-Fehre et al., 1992 , supra), it was shown 
that a mixture of all the oligothymidylic acids from n=12 to 5 
n=18 nucleotides could be resolved. 

In U.S. Pat. No. 5,064,754, RNA transcripts extended by 
DNA both of which are complementary to the DNA to be 
sequenced are prepared by incorporating NTP's, dNTP's 
and, as terminating nucleotides, ddNTP's which are substi- 10 
tuted at the 5'-position of the sugar moiety with one or a 
combination of the isotopes 12 C, 13 C, 14 C, X H, 2 H, 3 H, ie O, 
17 0 and ls O. The polynucleotides obtained are degraded to 
3' -nucleotides, cleaved at the N-glycosidic linkage and the 
isotopically labeled 5 '-functionality removed by periodate 15 
oxidation and the resulting formaldehyde species deter- 
mined by mass spectrometry. A specific combination of 
isotopes serves to discriminate base-specifically between 
internal nucleotides originating from the incorporation of 
NTP's and dNTP's and terminal nucleotides caused by 2Q 
linking ddNTP's to the end of the polynucleotide chain. A 
series of RNA/DNA fragments is produced, and in one 
embodiment, separated by electrophoresis, and, with the aid 
of the so-called matrix method of analysis, the sequence is 
deduced. 25 

In Japanese Patent No. 59-131909, an instrument is 
described which detects nucleic acid fragments separated 
either by electrophoresis, liquid chromatography or high 
speed gel filtration. Mass spectrometric detection is 
achieved by incorporating into the nucleic acids atoms 30 
which normally do not occur in DNA such as S, Br, I or Ag, 
Au, Pt, Os, Hg. The method, however, is not applied to 
sequencing of DNA using the Sanger method. In particular, 
it does not propose a base-specific correlation of such 
elements to an individual ddNTP. 35 

PCT Application No. WO 89/12694 (Brennan et al., Proc. 
SPIE-Int. Soc. Opt. Eng. 1206, (New Technol. Cytom. Mol. 
Biol), pp. 60-77 (1990); and Brennan, U.S. Pat. No. 5,003, 
059) employs the Sanger methodology for DNA sequencing 
by using a combination of either the four stable isotopes 32 S, 40 
33 S, 34 S, 36 S or 35 C1, 37 C1, 79 Br, 81 Br to specifically label the 
chain-terminating ddNTP's. The sulfur isotopes can be 
located either in the base or at the alpha-position of the 
triphosphate moiety whereas the halogen isotopes are 
located either at the base or at the 3'-position of the sugar 45 
ring. The sequencing reaction mixtures are separated by an 
electrophoretic technique such as CZE, transferred to a 
combustion unit in which the sulfur isotopes of the incor- 
porated ddNTP's are transformed at about 900° C. in an 
oxygen atmosphere. The S0 2 generated with masses of 64, 50 
65, 66 or 68 is determined on-line by mass spectrometry 
using, e.g., as mass analyzer, a quadrupole with a single 
ion-multiplier to detect the ion current. 

A similar approach is proposed in U.S. Pat. No. 5,002,868 
(Jacobson et al., Proc. SPIE-Int. Soc. Opt. Eng. 1435, (Opt. 55 
Methods Ultrasensitive Detect. Anal. Tech. Appl), 26-35 
(1991)) using Sanger sequencing with four ddNTP's spe- 
cifically substituted at the alpha-position of the triphosphate 
moiety with one of the four stable sulfur isotopes as 
described above and subsequent separation of the four sets 60 
of nested sequences by tube gel electrophoresis. The only 
difference is the use of resonance ionization spectroscopy 
(RIS) in conjunction with a magnetic sector mass analyzer 
as disclosed in U.S. Pat. No. 4,442,354 to detect the sulfur 
isotopes corresponding to the specific nucleotide 65 
terminators, and by this, allowing the assignment of the 
DNA sequence. 
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EPO Patent Applications No. 0360676 Al and 0360677 
Al also describe Sanger sequencing using stable isotope 
substitutions in the ddNTP's such as D, 13 C, 15 N, 17 0, ls O, 
32 S, 33 S, 34 S, 36 S, 19 F, 35 C1, 37 C1, 79 Br, 81 Br and 127 I or 
functional groups such as CF 3 or Si(CH 3 ) 3 at the base, the 
sugar or the alpha position of the triphosphate moiety 
according to chemical functionality. The Sanger sequencing 
reaction mixtures are separated by tube gel electrophoresis. 
The effluent is converted into an aerosol by the electrospray/ 
thermospray nebulizer method and then atomized and ion- 
ized by a hot plasma (7000 to 8000° K.) and analyzed by a 
simple mass analyzer. An instrument is proposed which 
enables one to automate the analysis of the Sanger sequenc- 
ing reaction mixture consisting of tube electrophoresis, a 
nebulizer and a mass analyzer. 

The application of mass spectrometry to perform DNA 
sequencing by the hybridization/fragment method (see 
above) has been recently suggested (Bains, "DNA Sequenc- 
ing by Mass Spectrometry: Outline of a Potential Future 
Application," Chimicaoggi 9, 13-16 (1991)). 

SUMMARY OF THE INVENTION 
The invention describes a new method to sequence DNA. 
The improvements over the existing DNA sequencing tech- 
nologies include high speed, high throughput, no required 
electrophoresis (and, thus, no gel reading artifacts due to the 
complete absence of an electrophoretic step), and no costly 
reagents involving various substitutions with stable isotopes. 
The invention utilizes the Sanger sequencing strategy and 
assembles the sequence information by analysis of the 
nested fragments obtained by base-specific chain termina- 
tion via their different molecular masses using mass 
spectrometry, for example, MALDI or ES mass spectrom- 
etry. A further increase in throughput can be obtained by 
introducing mass modifications in the oligonucleotide 
primer, the chain-terminating nucleoside triphosphates and/ 
or the chain-elongating nucleoside triphosphates, as well as 
using integrated tag sequences which allow multiplexing by 
hybridization of tag specific probes with mass differentiated 
molecular weights. 

BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 is a representation of a process to generate the 
samples to be analyzed by mass spectrometry. This process 
entails insertion of a DNA fragment of unknown sequence 
into a cloning vector such as derivatives of M13, pUC or 
phagemids; transforming the double-stranded form into the 
single-stranded form; performing the four Sanger sequenc- 
ing reactions; linking the base -specifically terminated nested 
fragment family temporarily to a solid support; removing by 
a washing step all by-products; conditioning the nested DNA 
or RNA fragments by, for example, cation-ion exchange or 
modification reagent and presenting the immobilized nested 
fragments either directly to mass spectrometric analysis or 
cleaving the purified fragment family off the support and 
evaporating the cleavage reagent. 

FIG. 2A shows the Sanger sequencing products using 
ddTTP as terminating deoxy nucleoside triphosphate of a 
hypothetical DNA fragment of 50 nucleotides (SEQ ID 
NO:3) in length with approximately equally balanced base 
composition. The molecular masses of the various chain 
terminated fragments are given. 

FIG. 2B shows an idealized mass spectrum of such a DNA 
fragment mixture. 

FIGS. 3A and 3B show, in analogy to FIGS. 2A and 2B, 
data for the same model sequence (SEQ ID NO:3) with 
ddATP as chain terminator. 
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FIGS. 4Aand 4B show data, analogous to FIGS. 2A and 
2B when ddGTP is used as a chain terminator for the same 
model sequence (SEQ ID NO:3). 

FIGS. 5A and 5B illustrate the results obtained where 
chain termination is performed with ddCTP as a chain 
terminator, in a similar way as shown in FIGS. 2A and 2B 
for the same model sequence (SEQ ID NO: 3). 

FIG. 6 summarizes the results of FIGS. 2Ato 5B, showing 
the correlation of molecular weights of the nested four 
fragment families to the DNA sequence (SEQ ID NO: 3). 

FIGS. 7A and 7B illustrates the general structure of 
mass-modified sequencing nucleic acid primers or tag 
sequencing probes for either Sanger DNA or Sanger RNA 
sequencing. 

FIGS. 8A and 8B shows the general structure for the 
mass-modified triphosphates for either Sanger DNA or 
Sanger RNA sequencing. General formulas of the chain- 
elongating and the chain-terminating nucleoside triphos- 
phates are demonstrated. 

FIG. 9 outlines various linking chemistries (X) with either 
polyethylene glycol or terminally monoalkylated polyethyl- 
ene glycol (R) as an example. 

FIG. 10 illustrates similar linking chemistries as shown in 
FIG. 8 and depicts various mass modifying moieties (R). 

FIG. 11 outlines how multiplex mass spectrometric 
sequencing can work using the mass-modified nucleic acid 
primer (UP). 

FIG. 12 shows the process of multiplex mass spectromet- 
ric sequencing employing mass-modified chain-elongating 
and/or terminating nucleoside triphosphates. 

FIG. 13 shows multiplex mass spectrometric sequencing 
by involving the hybridization of mass-modified tag 
sequence specific probes. 

FIG. 14 shows a MALDI-TOF spectrum of a mixture of 
ohgothymidylic acids, d(pT) 12 _ ia . 

FIG. 15 shows a superposition of MALDI-TOF spectra of 
the 50-mer d(TAACGGTCATTACGGCCATTGA- 
CTGTAGGACCTGCATTACATGACTAGCT) (SEQ ID 
NO:3) (500 fmol) and dT(pdT) 99 (500 fmol). 

FIGS. 16A-M shows the MALDI-TOF spectra of all 13 
DNA sequences representing the nested dT-terminated frag- 
ments of the Sanger DNA sequencing simulation of FIG. 2, 
500 fmol each. 

FIGS. 17Aand 17B shows the superposition of the spectra 
of FIG. 16. The two panels show two different scales and the 
spectra analyzed at that scale. 

FIG. 18 shows the superimposed MALDI-TOF spectra 
from MALDI-MS analysis of mass-modified oligonucle- 
otides as described in Example 21. 

FIG. 19 illustrates various linking chemistries between 
the solid support (P) and the nucleic acid primer (NA) 
through a strong electrostatic interaction. 

FIGS. 20Aand 20B illustrates various linking chemistries 
between the solid support (P) and the nucleic acid primer 
(NA) through a charge transfer complex of a charge transfer 
acceptor (A) and a charge transfer donor (D). 

FIG. 21 illustrates various linking chemistries between 
the solid support (P) and the nucleic acid primer (NA) 
through a stable organic radical. 

FIG. 22 illustrates a possible linking chemistry between 
the solid support (P) and the nucleic acid primer (NA) 
through Watson-Crick base pairing. 

FIG. 23 illustrates linking the solid support (P) and the 
nucleic acid primer (NA) through a photolytically cleavable 
bond. 
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FIG. 24 shows the portion of the sequence of pRFcl DNA, 
which was used as template for PCR amplification of 
unmodified and 7-deazapurine containing 99-mer and 200- 
mer nucleic acids as well as the sequences of the 19-primers 

5 and the two 18-mer reverse primers. 

FIG. 25 shows the portion of the nucleotide sequence of 
M13mpl8 RFI DNA, which was used for PCR amplification 
of unmodified and 7-deazapurine containing 103-mer 
nucleic acids. Also shown are nucleotide sequences of the 

10 17-mer primers used in the PCR. 

FIG. 26 shows the result of a polyacrylamide gel elec- 
trophoresis of PCR products purified and concentrated for 
MALDI-TOF MS analysis. M: chain length marker, lane 1: 
7-deazapurine containing 99-mer PCR product, lane 2: 

15 unmodified 99-mer, lane 3: 7-deazapurine containing 103- 
mer and lane 4: unmodified 103-mer PCR product. 

FIG. 27: an autoradiogram of polyacrylamide gel elec- 
trophoresis of PCR reactions carried out with 5'-[ 32 P]- 
labeled primers 1 and 4. Lanes 1 and 2: unmodified and 

20 7-deazapurine modified 103-mer PCR product (53321 and 
23520 counts), lanes 3 and 4: unmodified and 7-deazapurine 
modified 200-mer (71123 and 39582 counts) and lanes 5 and 
6: unmodified and 7-deazapurine modified 99-mer (173216 
and 94400 counts). 

25 FIG. 28: a) MALDI-TOF mass spectrum of the unmodi- 
fied 103-mer PCR products (sum of twelve single shot 
spectra). The mean value of the masses calculated for the 
two single strands (31768 u and 31759 u) is 31763 u. Mass 

3Q resolution: 18. b) MALDI-TOF mass spectrum of 
7-deazapurine containing 103-mer PCR product (sum of 
three single shot spectra). The mean value of the masses 
calculated for the two single strands (31727 u and 31719u) 
is 31723 u. Mass resolution: 67. 

35 FIG. 29: a) MALDI-TOF mass spectrum of the unmodi- 
fied 99-mer PCR product (sum of twenty single shot 
spectra). Values of the masses calculated for the two single 
strands: 30261 u and 30794 u. b) MALDI-TOF mass spec- 
trum of the 7-deazapurine containing 99-mer PCR product 

40 (sum of twelve single shot spectra). Values of the masses 
calculated for the two single strands: 30224 u and 30750 u. 

FIG. 30: a) MALDI-TOF mass spectrum of the unmodi- 
fied 200-mer PCR product (sum of 30 single shot spectra). 
The mean value of the masses calculated for the two single 

45 strands (61873 u and 61595 u) is 61734 u. Mass resolution: 
28. b) MALDI-TOF mass spectrum of 7-deazapurine con- 
taining 200-mer PCR product (sum of 30 single shot 
spectra). The mean value of the masses calculated for the 
two single strands (61772 u and 61514 u) is 61643 u. Mass 

50 resolution: 39. 

FIG. 31: a) MALDI-TOF mass spectrum of 7-deazapurine 
containing 100-mer PCR product with ribo modified primers. 
The mean value of the masses calculated for the two single 
strands (30529 u and 31095 u) is 30812 u. b) MALDI-TOF 

55 mass spectrum of the PCR-product after hydrolytic primer- 
cleavage. The mean value of the masses calculated for the 
two single strands (25104 u and 25229 u) is 25167 u. The 
mean value of the cleaved primers (5437 u and 5918 u) is 
5677 u. 

60 FIGS. 32 A— D shows the MALDI-TOF mass spectrum of 
the four sequencing ladders obtained from a 39-mer tem- 
plate (SEQ. ID. No. 13), which was immobilized to strepta- 
vidin beads via a 3' biotinylation. A 14-mer primer (SEQ. ID. 
NO. 14) was used in the sequencing. 

65 FIG. 33 shows a MALDI-TOF mass spectrum of a solid 
state sequencing of a 78-mer template (SEQ. ID. No. 15), 
which was immobilized to streptavidin beads via a 3' bioti- 
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nylation. A 18-mer primer (SEQ ID No. 16) and ddGTP 
were used in the sequencing. 

FIG. 34 shows a scheme in which duplex DNA probes 
with single-stranded overhang capture specific DNA tem- 
plates and also serve as primers for solid state sequencing. 5 

FIGS. 35A-D shows MALDI-TOF mass spectra obtained 
from a 5' fluorescent labeled 23-mer (SEQ. ID. No. 19) 
annealed to an 3' biotinylated 18-mer (SEQ. ID. No. 20), 
leaving a 5-base overhang, which captured a 15-mer tem- 
plate (SEQ. ID. No. 21). 10 

FIGS. 36A-36D shows a stacking flurogram of the same 
products obtained from the reaction described in FIG. 35, 
but run on a conventional DNA sequencer. 

DETAILED DESCRIPTION OF THE 15 
INVENTION 

This invention describes an improved method of sequenc- 
ing DNA. In particular, this invention employs mass spec- 
trometry to analyze the Sanger sequencing reaction mix- 2Q 
tures. 

In Sanger sequencing, four families of chain-terminated 
fragments are obtained. The mass difference per nucleotide 
addition is 289.19 for dpC, 313.21 for dpA, 329.21 for dpG 
and 304.2 for dpT, respectively. 25 

In one embodiment, through the separate determination of 
the molecular weights of the four base -specifically termi- 
nated fragment families, the DNA sequence can be assigned 
via superposition (e.g., interpolation) of the molecular 
weight peaks of the four individual experiments. In another 30 
embodiment, the molecular weights of the four specifically 
terminated fragment families can be determined simulta- 
neously by MS, either by mixing the products of all four 
reactions run in at least two separate reaction vessels (i.e., all 
run separately, or two together, or three together) or by 35 
running one reaction having all four chain-terminating 
nucleotides (e.g., a reaction mixture comprising dTTP, 
ddTTP, dATP, ddATP, dCTP, ddCTP, dGTP, ddGTP) in one 
reaction vessel. By simultaneously analyzing all four base- 
specifically terminated reaction products, the molecular 40 
weight values have been, in effect, interpolated. Comparison 
of the mass difference measured between fragments with the 
known masses of each chain -terminating nucleotide allows 
the assignment of sequence to be carried out. In some 
instances, it may be desirable to mass modify, as discussed 45 
below, the chain-terminating nucleotides so as to expand the 
difference in molecular weight between each nucleotide. It 
will be apparent to those skilled in the art when mass- 
modification of the chain-terminating nucleotides is desir- 
able and can depend, for instance, on the resolving ability of 50 
the particular spectrometer employed. By way of example, 
it may be desirable to produce four chain-terminating 
nucleotides, ddTTP, ddCTP 1 , ddATP 2 and ddGTP 3 where 
ddCTP 1 , ddATP 2 and ddGTP 3 have each been mass- 
modified so as to have molecular weights resolvable from 55 
one another by the particular spectrometer being used. 

The terms chain-elongating nucleotides and chain- 
terminating nucleotides are well known in the art. For DNA, 
chain-elongating nucleotides include 
2'-deoxyribonucleotides and chain-terminating nucleotides 60 
include 2 T ,3'-dideoxyribonucleotides. For RNA, chain- 
elongating nucleotides include ribonucleotides and chain- 
terminating nucleotides include 3'-deoxyribonucleotides. 
The term nucleotide is also well known in the art. For the 
purposes of this invention, nucleotides include nucleoside 65 
mono-, di-, and triphosphates. Nucleotides also include 
modified nucleotides such as phosphorothioate nucleotides. 
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Since mass spectrometry is a serial method, in contrast to 
currently used slab gel electrophoresis which allows several 
samples to be processed in parallel, in another embodiment 
of this invention, a further improvement can be achieved by 
multiplex mass spectrometric DNA sequencing to allow 
simultaneous sequencing of more than one DNA or RNA 
fragment. As described in more detail below, the range of 
about 300 mass units between one nucleotide addition can 
be utilized by employing either mass-modified nucleic acid 
sequencing primers or chain-elongating and/or terminating 
nucleoside triphosphates so as to shift the molecular weight 
of the base-specifically terminated fragments of a particular 
DNA or RNA species being sequenced in a predetermined 
manner. For the first time, several sequencing reactions can 
be mass spectrometrically analyzed in parallel. In yet 
another embodiment of this invention, multiplex mass spec- 
trometric DNA sequencing can be performed by mass modi- 
fying the fragment families through specific oligonucle- 
otides (tag probes) which hybridize to specific tag sequences 
within each of the fragment families. In another 
embodiment, the tag probe can be covalently attached to the 
individual and specific tag sequence prior to mass spectrom- 
etry. 

Preferred mass spectrometer formats for use in the inven- 
tion are matrix assisted laser desorption ionization 
(MALDI), electrospray (ES), ion cyclotron resonance (ICR) 
and Fourier Transform. For ES, the samples, dissolved in 
water or in a volatile buffer, are injected either continuously 
or discontinuously into an atmospheric pressure ionization 
interface (API) and then mass analyzed by a quadrupole. 
The generation of multiple ion peaks which can be obtained 
using ES mass spectrometry can increase the accuracy of the 
mass determination. Even more detailed information on the 
specific structure can be obtained using an MS/MS quadru- 
pole configuration 

In MALDI mass spectrometry, various mass analyzers 
can be used, e.g., magnetic sector/magnetic deflection 
instruments in single or triple quadrupole mode (MS/MS), 
Fourier transform and time-of- flight (TOF) configurations as 
is known in the art of mass spectrometry. For the desorption/ 
ionization process, numerous matrix/laser combinations can 
be used. Ion- trap and reflectron configurations can also be 
employed. 

In one embodiment of the invention, the molecular weight 
values of at least two base-specifically terminated fragments 
are determined concurrently using mass spectrometry. The 
molecular weight values of preferably at least five and more 
preferably at least ten base-specifically terminated frag- 
ments are determined by mass spectrometry. Also included 
in the invention are determinations of the molecular weight 
values of at least 20 base-specifically terminated fragments 
and at least 30 base-specifically terminated fragments. 
Further, the nested base-specifically terminated fragments in 
a specific set can be purified of all reactants and by-products 
but are not separated from one another. The entire set of 
nested base-specifically terminated fragments is analyzed 
concurrently and the molecular weight values are deter- 
mined. At least two base-specifically terminated fragments 
are analyzed concurrently by mass spectrometry when the 
fragments are contained in the same sample. 

In general, the overall mass spectrometric DNA sequenc- 
ing process will start with a library of small genomic 
fragments obtained after first randomly or specifically cut- 
ting the genomic DNA into large pieces which then, in 
several subcloning steps, are reduced in size and inserted 
into vectors like derivatives of M13 or pUC (e.g., M13mp 18 
or M13mpl9) (see FIG. 1). In a different approach, the 
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fragments inserted in vectors, such as Ml 3, are obtained via 
subcloning starting with a cDNA library. In yet another 
approach, the DNA fragments to be sequenced are generated 
by the polymerase chain reaction (e.g., Higuchi et al., "A 
General Method of in vitro Preparation and Mutagenesis of 5 
DNA Fragments: Study of Protein and DNA Interactions," 
Nucleic Acids Res., 16, 7351-67 (1988)). As is known in the 
art, Sanger sequencing can start from one nucleic acid 
primer (UP) binding to the plus-strand or from another 
nucleic acid primer binding to the opposite minus-strand, 
Thus, either the complementary sequence of both strands of 
a given unknown DNA sequence can be obtained (providing 
for reduction of ambiguity in the sequence determination) or 
the length of the sequence information obtainable from one 
clone can be extended by generating sequence information 15 
from both ends of the unknown vector-inserted DNA frag- 
ment. 

The nucleic acid primer carries, preferentially at the 
5 '-end, a linking functionality, L, which can include a spacer 
of sufficient length and which can interact with a suitable 20 
functionality, L, on a solid support to form a reversible 
linkage such as a photocleavable bond. Since each of the 
four Sanger sequencing families starts with a nucleic acid 
primer (L-UP; FIG. 1) this fragment family can be bound to 
the solid support by reacting with functional groups, L T , on 25 
the surface of a solid support and then intensively washed to 
remove all buffer salts, triphosphates, enzymes, reaction 
by-products, etc. Furthermore, for mass spectrometric 
analysis, it can be of importance at this stage to exchange the 
cation at the phosphate backbone of the DNA fragments in 30 
order to eliminate peak broadening due to a heterogeneity in 
the cations bound per nucleotide unit. Since the L-L' linkage 
is only of a temporary nature with the purpose to capture the 
nested Sanger DNA or RNA fragments to properly condition 
them for mass spectrometric analysis, there are different 35 
chemistries which can serve this purpose. In addition to the 
examples given in which the nested fragments are coupled 
covalently to the solid support, washed, and cleaved off the 
support for mass spectrometric analysis, the temporary link- 
age can be such that it is cleaved under the conditions of 40 
mass spectrometry, i.e., a photocleavable bond such as a 
charge transfer complex or a stable organic radical. 
Furthermore, the linkage can be formed with L being a 
quaternary ammonium group (some examples are given in 
FIG. 19). In this case, preferably, the surface of the solid 45 
support carries negative charges which repel the negatively 
charged nucleic acid backbone and thus facilitates desorp- 
tion. Desorption will take place either by the heat created by 
the laser pulse and/or, depending on L,' by specific absorp- 
tion of laser energy which is in resonance with the L' 50 
chromophore (see, e.g., examples given in FIG. 19). The 
functionalities, L and L, 1 can also form a charge transfer 
complex and thereby form the temporary L-L' linkage. 
Various examples for appropriate functionalities with either 
acceptor or donator properties are depicted without limita- 55 
tion in FIG. 20. Since in many cases the "charge-transfer 
band" can be determined by UV/vis spectrometry (see e.g. 
Organic Charge Transfer Complexes by R. Foster, Aca- 
demic Press, 1969), the laser energy can be tuned to the 
corresponding energy of the charge-transfer wavelength go 
and, thus, a specific desorption off the solid support can be 
initiated. Those skilled in the art will recognize that several 
combinations can serve this purpose and that the donor 
functionality can be either on the solid support or coupled to 
the nested Sanger DNA/RNA fragments or vice versa. 55 

In yet another approach, the temporary linkage L-L can 
be generated by homolytically forming relatively stable 
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radicals as exemplified in FIG. 21. In example 4 of FIG. 21, 
a combination of the approaches using charge-transfer com- 
plexes and stable organic radicals is shown. Here, the nested 
Sanger DNA/RNA fragments are captured via the formation 
of a charge transfer complex. Under the influence of the laser 
pulse, desorption (as discussed above) as well as ionization 
will take place at the radical position. In the other examples 
of FIG. 21 under the influence of the laser pulse, the L-L" 
linkage will be cleaved and the nested Sanger DNA/RNA 
fragments desorbed and subsequently ionized at the radical 
position formed. Those skilled in the art will recognize that 
other organic radicals can be selected and that, in relation to 
the dissociation energies needed to homolytically cleave the 
bond between them, a corresponding laser wavelength can 
be selected (see e.g. Reactive Molecules by C. Wentrup, 
John Wiley & Sons, 1984). In yet another approach, the 
nested Sanger DNA/RNA fragments are captured via 
Watson-Crick base pairing to a solid support-bound oligo- 
nucleotide complementary to either the sequence of the 
nucleic acid primer or the tag oligonucleotide sequence (see 
FIG. 22). The duplex formed will be cleaved under the 
influence of the laser pulse and desorption can be initiated. 
The solid support-bound base sequence can be presented 
through natural oligoribo- or oligodeoxyribonucleotide as 
well as analogs (e.g. thio-modified phosphodiester or phos- 
photriester backbone) or employing oligonucleotide mimet- 
ics such as PNA analogs (see e.g. Nielsen et al., Science, 
254, 1497 (1991)) which render the base sequence less 
susceptible to enzymatic degradation and hence increases 
overall stability of the solid support-bound capture base 
sequence. With appropriate bonds, L-L', a cleavage can be 
obtained directly with a laser tuned to the energy necessary 
for bond cleavage. Thus, the immobilized nested Sanger 
fragments can be directly ablated during mass spectrometric 
analysis. 

Prior to mass spectrometric analysis, it may be useful to 
"condition" nucleic acid molecules, for example to decrease 
the laser energy required for volatization and/or to minimize 
fragmentation. One example of conditioning is modification 
of the phosphodiester backbone of the nucleic acid molecule 
(e.g. cation exchange), which can be useful for eliminating 
peak broadening due to a heterogeneity in the cations bound 
per nucleotide unit. In addition, a nucleic acid molecule can 
be contacted with an alkylating agent such as alkyliodide, 
iodoacetamide, p-iodoethanol, or 2,3-epoxy-l-propanol, the 
monothio phosphodiester bonds of a nucleic acid molecule 
can be transformed into a phosphotriester bond. Likewise, 
phosphodiester bonds may be transformed to uncharged 
derivatives employing trialkylsilyl chlorides. Further condi- 
tioning involves incorporating nucleotides which reduce 
sensitivity for depurination (fragmentation during MS) such 
as N7- or N9-deazapurine nucleotides, or RNA building 
blocks or using oligonucleotide triesters or incorporating 
phosphorothioate functions which are alkylated or employ- 
ing oligonucleotide mimetics such as PNA. 

Modification of the phosphodiester backbone can be 
accomplished by, for example, using alpha-thio modified 
nucleotides for chain elongation and termination. With alky- 
lating agents such as akyliodides, iodoacetamide, 
P-iodoethanol, 2,3-epoxy-l-propanol (see FIG. 10), the 
monothio phosphodiester bonds of the nested Sanger frag- 
ments are transformed into phosphotriester bonds. Multi- 
plexing by mass modification in this case is obtained by 
mass-modifying the nucleic acid primer (UP) or the nucleo- 
side triphosphates at the sugar or the base moiety. To those 
skilled in the art, other modifications of the nested Sanger 
fragments can be envisioned. In one embodiment of the 
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invention, the linking chemistry allows one to cleave off the 
so-purified nested DNA enzymatically, chemically or physi- 
cally. By way of example, the L-L T chemistry can be of a 
type of disulfide bond (chemically cleavable, for example, 
by mercaptoethanol or dithioerythrol), a biotin/streptavidin 5 
system, a heterobi functional derivative of a trityl ether 
group (Koster et aL, "A Versatile Acid -Labile Linker for 
Modification of Synthetic Biomolecules," Tetrahedron Let- 
ters 31, 7095 (1990)) which can be cleaved under mildly 
acidic conditions, a levulinyl group cleavable under almost 10 
neutral conditions with a hydrazimum/acetate buffer, an 
arginine-arginine or lysine -lysine bond cleavable by an 
endopeptidase enzyme like trypsin or a pyrophosphate bond 
cleavable by a pyrophosphatase, a photocleavable bond 
which can be, for example, physically cleaved and the like 15 
(see, e.g., FIG. 23). Optionally, another cation exchange can 
be performed prior to mass spectrometric analysis. In the 
instance that an enzyme-cleavable bond is utilized to immo- 
bilize the nested fragments, the enzyme used to cleave the 
bond can serve as an internal mass standard during MS 20 
analysis. 

The purification process and/or ion exchange process can 
be carried out by a number of other methods instead of, or 
in conjunction with, immobilization on a solid support. For 
example, the base -specifically terminated products can be 25 
separated from the reactants by dialysis, filtration (including 
ultrafiltration), and chromatography. Likewise, these tech- 
niques can be used to exchange the cation of the phosphate 
backbone with a counter-ion which reduces peak broaden- 
ing. 30 

The base-specifically terminated fragment families can be 
generated by standard Sanger sequencing using the Large 
Klenow fragment of E. coli DNA polymerase I, by 
Sequenase, Taq DNA polymerase and other DNA poly- 
merases suitable for this purpose, thus generating nested 35 
DNA fragments for the mass spectrometric analysis. It is, 
however, part of this invention that base -specifically termi- 
nated RNA transcripts of the DNA fragments to be 
sequenced can also be utilized for mass spectrometric 
sequence determination. In this case, various RNA poly- 40 
merases such as the SP6 or the T7 RNA polymerase can be 
used on appropriate vectors containing, for example, the SP6 
or the T7 promoters (e.g. Axelrod et al, "Transcription from 
Bacteriophage T7 and SP6 RNA Polymerase Promoters in 
the Presence of 3'-Deoxyribonucleoside 5' -triphosphate 45 
Chain Terminators," Biochemistry 24, 5716-23 (1985)). In 
this case, the unknown DNA sequence fragments are 
inserted downstream from such promoters. Transcription 
can also be initiated by a nucleic acid primer (Pitulle et al., 
"Initiator Oligonucleotides for the Combination of Chemical 50 
and Enzymatic RNA Synthesis," Gene 112, 101-105 
(1992)) which carries, as one embodiment of this invention, 
appropriate linking functionalities, L, which allow the 
immobilization of the nested RNA fragments, as outlined 
above, prior to mass spectrometric analysis for purification 55 
and/or appropriate modification and/or conditioning. 

For this immobilization process of the DNA/RNA 
sequencing products for mass spectrometric analysis, vari- 
ous solid supports can be used, e.g., beads (silica gel, 
controlled pore glass, magnetic beads, Sephadex/Sepharose 60 
beads, cellulose beads, etc.), capillaries, glass fiber filters, 
glass surfaces, metal surfaces or plastic material. Examples 
of useful plastic materials include membranes in filter or 
microtiter plate formats, the latter allowing the automation 
of the purification process by employing microtiter plates 65 
which, as one embodiment of the invention, carry a perme- 
able membrane in the bottom of the well functionalized with 


L'. Membranes can be based on polyethylene, 
polypropylene, poly amide, polyvinylidenedifLuoride and the 
like. Examples of suitable metal surfaces include steel, gold, 
silver, aluminum, and copper. After purification, cation 
exchange, and/or modification of the phosphodiester back- 
bone of the L-L' bound nested Sanger fragments, they can be 
cleaved off the solid support chemically, enzymatically or 
physically. Also, the L-L' bound fragments can be cleaved 
from the support when they are subjected to mass spectro- 
metric analysis by using appropriately chosen L-L' linkages 
and corresponding laser energies/intensities as described 
above and in FIGS. 19-23. 

The highly purified, four base-specifically terminated 
DNA or RNA fragment families are then analyzed with 
regard to their fragment lengths via determination of their 
respective molecular weights by MALDI or ES mass spec- 
trometry. 

For ES, the samples, dissolved in water or in a volatile 
buffer, are injected either continuously or discontinuously 
into an atmospheric pressure ionization interface (API) and 
then mass analyzed by a quadrupole. With the aid of a 
computer program, the molecular weight peaks are searched 
for the known molecular weight of the nucleic acid primer 
(UP) and determined which of the four chain-terminating 
nucleotides has been added to the UP. This represents the 
first nucleotide of the unknown sequence. Then, the second, 
the third, the XI th extension product can be identified in a 
similar manner and, by this, the nucleotide sequence is 
assigned. The generation of multiple ion peaks which can be 
obtained using ES mass spectrometry can increase the 
accuracy of the mass determination. 

In MALDI mass spectrometry, various mass analyzers 
can be used, e.g., magnetic sector/magnetic deflection 
instruments in single or triple quadrupole mode (MS/MS), 
Fourier transform and time-of- flight (TOF) configurations as 
is known in the art of mass spectrometry. FIGS. 2A through 
6 are given as an example of the data obtainable when 
sequencing a hypothetical DNA fragment of 50 nucleotides 
in length (SEQ ID NO:3) and having a molecular weight of 
15,344.02 daltons. The molecular weights calculated for the 
ddT (FIGS. 2A and 2B), ddA (FIGS. 3A and 3B), ddG 
(FIGS. 4A and 4B) and ddC (FIGURES 5 A and 5B) termi- 
nated products are given (corresponding to fragments of 
SEQ ID NO: 3) and the idealized four MALDI-TOF mass 
spectra shown. All four spectra are superimposed, and from 
this, the DNA sequence can be generated. This is shown in 
the summarizing FIG. 6, demonstrating how the molecular 
weights are correlated with the DNA sequence. MALDI- 
TOF spectra have been generated for the ddT terminated 
products (FIG. 16) corresponding to those shown in FIG. 2 
and these spectra have been superimposed (FIG. 17). The 
correlation of calculated molecular weights of the ddT 
fragments and their experimentally-verified weights are 
shown in Table 1. Likewise, if all four chain-terminating 
reactions are combined and then analyzed by mass 
spectrometry, the molecular weight difference between two 
adjacent peaks can be used to determine the sequence. For 
the desorption/ionization process, numerous matrix/laser 
combinations can be used. 


7-mer 

10- mer 

11- mer 
19-mer 


2104.45 
3011.04 
3315.24 
5771.82 


2119.9 
3026.1 
3330.1 
5788.0 


+15.4 
+15.1 
+14.9 
+16.2 
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20-mer 
24-mer 
26-mer 
33-mer 

37- mer 

38- mer 
42-mer 
46-mer 
50-mer 


6076.02 
7311.82 
7945.22 
10112.63 
11348.43 
11652.62 
12872.42 
14108.22 
15344.02 


6093.8 
7374.9 
7960.9 
10125.3 
11361.4 
11670.2 
128SS.3 
14125.0 
15362.6 


+17.8 
+63.1 
+15.7 
+12.7 
+13.0 
+17.6 
+15.9 
+16.8 
+18.6 


In order to increase throughput to a level necessary for 
high volume genomic and cDNA sequencing projects, a 
further embodiment of the present invention is to utilize ^ 
multiplex mass spectrometry to simultaneously determine 
more than one sequence. This can be achieved by several, 
albeit different, methodologies, the basic principle being the 
mass modification of the nucleic acid primer (UP), the 
chain-elongating and/or terminating nucleoside ^ 0 
triphosphates, or by using mass- differentiated tag probes 
hybridizable to specific tag sequences. The term "nucleic 
acid primer" as used herein encompasses primers for both 
DNA and RNA Sanger sequencing. 

By way of example, FIG. 7 presents a general formula of 25 
the nucleic acid primer (UP) and the tag probes (TP). The 
mass modifying moiety can be attached, for instance, to 
either the 5'-end of the oligonucleotide (M 1 ), to the nucleo- 
base (or bases) (M 2 , M 7 ), to the phosphate backbone (M 3 ), 
and to the 2'-position of the nucleoside (nucleosides) (M 4 , 30 
M e ) or/and to the terminal 3 , -position (M 5 ). Primer length 
can vary between 1 and 50 nucleotides in length. For the 
priming of DNA Sanger sequencing, the primer is prefer- 
entially in the range of about 15 to 30 nucleotides in length. 
For artificially priming the transcription in a RNA 35 
polymerase-mediated Sanger sequencing reaction, the 
length of the primer is preferentially in the range of about 2 
to 6 nucleotides. If a tag probe (TP) is to hybridize to the 
integrated tag sequence of a family chain-terminated 
fragments, its preferential length is about 20 nucleotides. 40 

The table in FIG. 7 depicts some examples of mass- 
modified primer/tag probe configurations for DNA, as well 
as RNA Sanger sequencing. This list is, however, not meant 
to be limiting, since numerous other combinations of mass- 
modifying functions and positions within the oligonucle- 45 
otide molecule are possible and are deemed part of the 
invention. The mass-modifying functionality can be, for 
example, a halogen, an azido, or of the type, XR, wherein X 
is a linking group and R is a mass-modifying functionality. 
The mass-modifying functionality can thus be used to intro- 50 
duce defined mass increments into the oligonucleotide mol- 
ecule. 

In another embodiment, the nucleotides used for chain- 
elongation and/or termination are mass-modified. Examples 
of such modified nucleotides are shown in FIG. 8. Here the 55 
mass -modifying moiety, M, can be attached either to the 
nucleobase, M 2 (in case of the c 7 -deazanucleosides also to 
C-7, M 7 ), to the triphosphate group at the alpha phosphate, 
M 3 , or to the 2'-position of the sugar ring of the nucleoside 
triphosphate, M 4 and M 6 . Furthermore, the mass-modifying 60 
functionality can be added so as to affect chain termination, 
such as by attaching it to the 3'-position of the sugar ring in 
the nucleoside triphosphate, M 5 . The list in FIG. 8 represents 
examples of possible configurations for generating chain- 
terminating nucleoside triphosphates for RNA or DNA 65 
Sanger sequencing. For those skilled in the art, however, it 
is clear that many other combinations can serve the purpose 


of the invention equally well. In the same way, those skilled 
in the art will recognize that chain-elongating nucleoside 
triphosphates can also be mass-modified in a similar fashion 
with numerous variations and combinations in functionality 
and attachment positions. 

Without limiting the scope of the invention, FIG. 9 gives 
a more detailed description of particular examples of how 
the mass-modification, M, can be introduced for X in XR as 
well as using oligo-/polyethylene glycol derivatives for R. 
The mass-modifying increment in this case is 44, i.e. five 
different mass-modified species can be generated by just 
changing m from 0 to 4 thus adding mass units of 45 (m=0), 
89 (m=l), 133 (m=2), 177 (m=3) and 221 (m-4) to the 
nucleic acid primer (UP), the tag probe (TP) or the nucleo- 
side triphosphates respectively. The oligo/polyethylene gly- 
cols can also be monoalkylated by a lower alkyl such as 
methyl, ethyl, propyl, isopropyl, t-butyl and the like. A 
selection of linking functionalities, X, are also illustrated. 
Other chemistries can be used in the mass-modified 
compounds, as for example, those described recently in 
Oligonucleotides and Analogues, A Practical Approach, F. 
Eckstein, editor, IRL Press, Oxford, 1991. 

In yet another embodiment, various mass-modifying 
functionalities, R, other than oligo/polyethylene glycols, can 
be selected and attached via appropriate linking chemistries, 
X. Without any limitation, some examples are given in FIG. 
10. A simple mass-modification can be achieved by substi- 
tuting H for halogens like F, CI, Br and/or I, or pseudohalo- 
gens such as SCN, NCS, or by using different alkyl, aryl or 
aralkyl moieties such as methyl, ethyl, propyl, isopropyl, 
t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or func- 
tional groups such as CH 2 F, CHF 2 , CF 3 , Si(CH 3 ) 3 , Si(CH 3 ) 
2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 F£ 5 ) 3 . Yet another mass- 
modification can be obtained by attaching homo- or 
heteropeptides through X to the UP, TP or nucleoside 
triphosphates. One example useful in generating mass- 
modified species with a mass increment of 57 is the attach- 
ment of oligoglycines, e.g., mass-modifications of 74 (r=l, 
m=0), 131 (r=l, m=2), 188 (r=l, m=3), 245 (r=l, m=4) are 
achieved. Simple oligoamides also can be used, e.g., mass- 
modifications of 74 (r=l, m=0), 88 (r-2, m=0), 102 (r=3, 
m=0), 116 (r=4, m=0), etc. are obtainable. For those skilled 
in the art, it will be obvious that there are numerous 
possibilities in addition to those given in FIG. 10 and the 
above mentioned reference (Oligonucleotides and 
Analogues, F. Eckstein, 1991), for introducing, in a prede- 
termined manner, many different mass-modifying function- 
alities to UP, TP and nucleoside triphosphates which are 
acceptable for DNA and RNA Sanger sequencing. 

As used herein, the superscript 0-i designates i+1 mass 
differentiated nucleotides, primers or tags. In some 
instances, the superscript 0 (e.g., NTP°, UP 0 ) can designate 
an unmodified species of a particular reactant, and the 
superscript i (e.g., NTP 1 , NTP 1 , NTP 2 , etc.) can designate 
the i-th mass-modified species of that reactant. If, for 
example, more than one species of nucleic acids (e.g., DNA 
clones) are to be concurrently sequenced by multiplex DNA 
sequencing, then i+1 different mass-modified nucleic acid 
primers (UP°, UP 1 , . . . UP C ) can be used to distinguish each 
set of base -specifically terminated fragments, wherein each 
species of mass-modified UP' can be distinguished by mass 
spectrometry from the rest. 

As illustrative embodiments of this invention, three dif- 
ferent basic processes for multiplex mass spectrometric 
DNA sequencing employing the described mass-modified 
reagents are described below: 

A) Multiplexing by the use of mass-modified nucleic acid 
primers (UP) for Sanger DNA or RNA sequencing (see 
for example FIG. 11); 
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B) Multiplexing by the use of mass-modified nucleoside 
triphosphates as chain elongators and/or chain termi- 
nators for Sanger DNA or RNA sequencing (see for 
example FIG. 12); and 

C) Multiplexing by the use of tag probes which specifi- 5 
cally hybridize to tag sequences which are integrated 
into part of the four Sanger DNA/RNA base- 
specifically terminated fragment families. Mass modi- 
fication here can be achieved as described for FIGS. 7, 

9 and 10, or alternately, by designing different oligo- 10 
nucleotide sequences having the same or different 
length with unmodified nucleotides which, in a prede- 
termined way, generate appropriately differentiated 
molecular weights (see for example FIG. 13). 
The process of multiplexing by mass-modified nucleic 15 
acid primers (UP) is illustrated by way of example in FIG. 
11 for mass analyzing four different DNA clones simulta- 
neously. The first reaction mixture is obtained by standard 
Sanger DNA sequencing having unknown DNA fragment 1 
(clone 1) integrated in an appropriate vector (e.g., 20 
M13mpl8), employing an unmodified nucleic acid primer 
UP°, and a standard mixture of the four unmodified deoxy- 
nucleoside triphosphates, dNTP 0 , and with Vioth of one of 
the four dideoxynucleoside triphosphates, ddNTP 0 . A sec- 
ond reaction mixture for DNA fragment 2 (clone 2) is 25 
obtained by employing a mass-modified nucleic acid primer 
UP 1 and, as before, the four unmodified nucleoside 
triphosphates, dNTP 0 , containing in each separate Sanger 
reaction Vioth of the chain-terminating unmodified dideoxy- 
nucleoside triphosphates ddNTP°. In the other two 30 
experiments, the four Sanger reactions have the following 
compositions: DNA fragment 3 (clone 3), UP 2 , dNTP 0 , 
ddNTP° and DNA fragment 4 (clone 4), UP 3 , dNTP 0 , 
ddNTP 0 . For mass spectrometric DNA sequencing, all base- 
specifically terminated reactions of the four clones are 35 
pooled and mass analyzed. The various mass peaks belong- 
ing to the four dideoxy-terminated (e.g., ddT-terminated) 
fragment families are assigned to specifically elongated and 
ddT-terminated fragments by searching (such as by a com- 
puter program) for the known molecular ion peaks of UP°, 40 
UP 1 , UP 2 and UP 3 extended by either one of the four 
dideoxynucleoside triphosphates, UP°-ddN°, UP^ddN 0 , 
UP 2 -ddN° and UP^ddN 0 . In this way, the first nucleotides 
of the four unknown DNA sequences of clone 1 to 4 are 
determined. The process is repeated, having memorized the 45 
molecular masses of the four specific first extension 
products, until the four sequences are assigned. Unambigu- 
ous mass/sequence assignments are possible even in the 
worst case scenario in which the four mass-modified nucleic 
acid primers are extended by the same dideoxynucleoside 50 
triphosphate, the extension products then being, for 
example, UP°-ddT, UP x -ddT, UP 2 -ddT and UP 3 -ddT, which 
differ by the known mass increment differentiating the four 
nucleic acid primers. In another embodiment of this 
invention, an analogous technique is employed using dif- 55 
ferent vectors containing, for example, the SP6 and/or T7 
promoter sequences, and performing transcription with the 
nucleic acid primers UP°, UP 1 , UP 2 and UP 3 and either an 
RNA polymerase (e.g., SP6 or T7 RNA polymerase) with 
chain-elongating and terminating unmodified nucleoside 60 
triphosphates NTP° and 3'-dNTP°. Here, the DNA sequence 
is being determined by Sanger RNA sequencing. 

FIG. 12 illustrates the process of multiplexing by mass- 
modified chain-elongating or/and terminating nucleoside 
triphosphates in which three different DNA fragments (3 65 
clones) are mass analyzed simultaneously. The first DNA 
Sanger sequencing reaction (DNA fragment 1, clone 1) is the 
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standard mixture employing unmodified nucleic acid primer 
UP 0 , dNTP 0 and in each of the four reactions one of the four 
ddNTP 0 . The second (DNA fragment 2, clone 2) and the 
third (DNA fragment 3, clone 3) have the following con- 
tents: UP 0 , dNTP 0 , ddNTP 1 and UP 0 , dNTP 0 , ddNTP 2 , 
respectively. In a variation of this process, an amplification 
of the mass increment in mass-modifying the extended DNA 
fragments can be achieved by either using an equally 
mass-modified deoxynucleoside triphosphate (i.e., dNTP 1 , 
dNTP 2 ) for chain elongation alone or in conjunction with the 
homologous equally mass-modified dideoxynucleoside 
triphosphate. For the three clones depicted above, the con- 
tents of the reaction mixtures can be as follows: either 
UP°/dNTP 0 /ddNTP 0 ,UP°/dNTP 1 /ddNTP 0 and UP°/dNTP 2 / 
ddNTP 0 or UP°/dNTP°/ddNTP°, UP°/dNTP 1 /ddNTP 1 and 
UP°/dNTP 2 /ddNTP 2 . As described above, DNA sequencing 
can be performed by Sanger RNA sequencing employing 
unmodified nucleic acid primers, UP 0 , and an appropriate 
mixture of chain-elongating and terminating nucleoside 
triphosphates. The mass-modification can be again either in 
the chain-terminating nucleoside triphosphate alone or in 
conjunction with mass-modified chain-elongating nucleo- 
side triphosphates. Multiplexing is achieved by pooling the 
three base -specifically terminated sequencing reactions 
(e.g., the ddTTP terminated products) and simultaneously 
analyzing the pooled products by mass spectrometry. Again, 
the first extension products of the known nucleic acid primer 
sequence are assigned, e.g., via a computer program. Mass/ 
sequence assignments are possible even in the worst case in 
which the nucleic acid primer is extended/terminated by the 
same nucleotide, e.g., ddT, in all three clones. The following 
configurations thus obtained can be well differentiated by 
their different mass-modifications: UP°-ddT°, UP°-ddT 1 , 
UP°-ddT 2 . 

In yet another embodiment of this invention, DNA 
sequencing by multiplex mass spectrometry can be achieved 
by cloning the DNA fragments to be sequenced in "plex- 
vectors" containing vector specific "tag sequences" as 
described (Koster et al., "Oligonucleotide Synthesis and 
Multiplex DNA Sequencing Using Chemiluminescent 
Detection," Nucleic Acids Res. Symposium Ser. No. 24, 
318-321 (1991)); then pooling clones from different plex- 
vectors for DNA preparation and the four separate Sanger 
sequencing reactions using standard dNTP°/ddNTP° and 
nucleic acid primer UP 0 ; purifying the four multiplex frag- 
ment families via linking to a solid support through the 
linking group, L, at the 5 1 -end of UP; washing out all 
by-products, and cleaving the purified multiplex DNA frag- 
ments off the support or using the L-L' bound nested Sanger 
fragments as such for mass spectrometric analysis as 
described above; performing de-multiplexing by one-by-one 
hybridization of specific "tag probes"; and subsequendy 
analyzing by mass spectrometry (see, for example, FIG. 13). 
As a reference point, the four base-specifically terminated 
multiplex DNA fragment families are run by the mass 
spectrometer and all ddT 0 -, ddA°-, ddC°- and ddG°- 
terminated molecular ion peaks are respectively detected 
and memorized. Assignment of, for example, ddT°- 
terminated DNA fragments to a specific fragment family is 
accomplished by another mass spectrometric analysis after 
hybridization of the specific tag probe (TP) to the corre- 
sponding tag sequence contained in the sequence of this 
specific fragment family. Only those molecular ion peaks 
which are capable of hybridizing to the specific tag probe are 
shifted to a higher molecular mass by the same known mass 
increment (e.g. of the tag probe). These shifted ion peaks, by 
virtue of all hybridizing to a specific tag probe, belong to the 
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same fragment family. For a given fragment family, this is 
repeated for the remaining chain terminated fragment fami- 
lies with the same tag probe to assign the complete DNA 
sequence. This process is repeated i-1 times corresponding 
to i clones multiplexed (the i-th clone is identified by 5 
default). 

The differentiation of the tag probes for the different 
multiplexed clones can be obtained just by the DNA 
sequence and its ability to Watson-Crick base pair to the tag 
sequence. It is well known in the art how to calculate 10 
stringency conditions to provide for specific hybridization of 
a given tag probe with a given tag sequence (see, for 
example, Molecular Cloning: A laboratory manual 2ed, ed. 
by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 
Laboratory Press: NY, 1989, Chapter 11). Furthermore, 15 
differentiation can be obtained by designing the tag 
sequence for each pie x- vector to have a sufficient mass 
difference so as to be unique just by changing the length or 
base composition or by mass-modifications according to 
FIGS. 7, 9 and 10. In order to keep the duplex between the 20 
tag sequence and the tag probe intact during mass spectro- 
metric analysis, it is another embodiment of the invention to 
provide for a covalent attachment mediated by, for example, 
photoreactive groups such as psoralen and ellipticine and by 
other methods known to those skilled in the art (see, for 25 
example, Helene et al., Nature 344, 358 (1990) and Thuong 
et al. "Oligonucleotides Attached to Intercalators, Photore- 
active and Cleavage Agents" in F. Eckstein, Oligonucle- 
otides and Analogues: A Practical Approach, IRL Press, 
Oxford 1991, 283-306). 30 

The DNA sequence is unraveled again by searching for 
the lowest molecular weight molecular ion peak correspond- 
ing to the known UP°-tag sequence/tag probe molecular 
weight plus the first extension product, e.g., ddT°, then the 
second, the third, etc. 35 

In a combination of the latter approach with the previ- 
ously described multiplexing processes, a further increase in 
multiplexing can be achieved by using, in addition to the tag 
probe/tag sequence interaction, mass-modified nucleic acid 
primers (FIG. 7) and/or mass-modified deoxynucleoside, 40 
dNTP 0 " 1 , and/or dideoxymicleoside triphosphates, ddNTP 0 "'. 
Those skilled in the art will realize that the tag sequence/tag 
probe multiplexing approach is not limited to Sanger DNA 
sequencing generating nested DNA fragments with DNA 
polymerases. The DNA sequence can also be determined by 45 
transcribing the unknown DNA sequence from appropriate 
promoter-containing vectors (see above) with various RNA 
polymerases and mixtures of NTP 0 "73'-dNTP°"\ thus gen- 
erating nested RNA fragments. 

In yet another embodiment of this invention, the mass- 50 
modifying functionality can be introduced by a two or 
multiple step process. In this case, the nucleic acid primer, 
the chain-elongating or terminating nucleoside triphosphates 
and/or the tag probes are, in a first step, modified by a 
precursor functionality such as azido, — N 3 , or modified 55 
with a functional group in which the R in XR is H (FIG. 7, 
9) thus providing temporary functions, e.g., but not limited 
to —OH, —NIL, — NHR, — SI I, — NCS, — OCOiCH^ 
COOH (r-1-20), -NHCO(CH 2 ) r COOH (r-1-20), 
— 0S0 2 0H, — OCO(CH 2 )J (r=l-20), — OP(0-Alkyl)N 60 
(Alkyl) 2 . These less bulky functionalities result in better 
substrate properties for the enzymatic DNA or RNA syn- 
thesis reactions of the DNA sequencing process. The appro- 
priate mass-modifying functionality is then introduced after 
the generation of the nested base -specifically terminated 65 
DNA or RNA fragments prior to mass spectrometry. Several 
examples of compounds which can serve as mass-modifying 
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functionalities are depicted in FIGS. 9 and 10 without 
limiting the scope of this invention. 

Another aspect of this invention concerns kits for 
sequencing nucleic acids by mass spectrometry which 
include combinations of the above-described sequencing 
reactants. For instance, in one embodiment, the kit com- 
prises reactants for multiplex mass spectrometric sequenc- 
ing of several different species of nucleic acid. The kit can 
include a solid support having a linking functionality (L 1 ) 
for immobilization of the base -specifically terminated prod- 
ucts; at least one nucleic acid primer having a linking group 
(L) for reversibly and temporarily linking the primer and 
solid support through, for example, a photocleavable bond; 
a set of chain-elongating nucleotides (e.g., dATP, dCTP, 
dGTP and dTTP, or ATP, CTP, GTP and UTP); a set of 
chain-terminating nucleotides (such as 2 f ,3'- 
dideoxynucleotides for DNA synthesis or 
3'-deoxynucleotides for RNA synthesis); and an appropriate 
polymerase for synthesizing complementary nucleotides. 
Primers and/or terminating nucleotides can be mass- 
modified so that the base-specifically terminated fragments 
generated from one of the species of nucleic acids to be 
sequenced can be distinguished by mass spectrometry from 
all of the others. Alternative to the use of mass-modified 
synthesis reactants, a set of tag probes (as described above) 
can be included in the kit. The kit can also include appro- 
priate buffers as well as instructions for performing multi- 
plex mass spectrometry to concurrently sequence multiple 
species of nucleic acids. 

In another embodiment, a nucleic acid sequencing kit can 
comprise a solid support as described above, a primer for 
initiating synthesis of complementary nucleic acid 
fragments, a set of chain-elongating nucleotides and an 
appropriate polymerase. The mass-modified chain- 
terminating nucleotides are selected so that the addition of 
one of the chain terminators to a growing complementary 
nucleic acid can be distinguished by mass spectrometry. 

The present invention is further illustrated by the follow- 
ing examples which should not be construed as limiting in 
any way. The contents of all cited references (including 
literature references, issued patents, published patent appli- 
cations (including international patent application Publica- 
tion Number WO 94/16101, entitled "DNA Sequencing by 
Mass Spectrometry" by H. Koester; and international patent 
application Publication Number WO 94/21822 entitled 
"DNA Sequencing by Mass Spectrometry "Via Exonuclease 
Degradation" by H. Koester), and co-pending patent 
applications, (including U.S patent application Ser. No. 
08/406,199, entitled "DNA Diagnostics Based on Mass 
Spectrometry" by H. Koester), as cited throughout this 
application are hereby expressly incorporated by reference. 

EXAMPLE 1 

Immobilization of Primer-extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
via Disulfide Bonds 

As a solid support, Sequelon membranes (Millipore 
Corp., Bedford, Mass.) with phenyl isothiocyanate groups 
are used as a starting material. The membrane disks, with a 
diameter of 8 mm, are wetted with a solution of 
N-methylmorpholine/water/2-propanol (NMM solution) 
(2/49/49 v/v/v), the excess liquid removed with filter paper 
and placed on a piece of plastic film or aluminum foil 
located on a heating block set to 55° C. A solution of 1 mM 
2-mercaptoethylamine (cysteamine) or 2,2'-dithio-bis 
(ethylamine) (cystamine) or S-(2-thiopyridyl)-2-thio- 
ethyl amine (10 ul, 10 nmol) in NMM is added per disk and 
heated at 55° C. After 15 min, 10 ul of NMM solution are 
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added per disk and heated for another 5 min. Excess of 
isothiocyanate groups may be removed by treatment with 10 
ul of a 10 mM solution of glycine in NMM solution. For 
cystamine, the disks are treated with 10 ul of a solution of 
1 M aqueous dithiothreitol (DTT)/2-propanol (1:1 v/v) for 
15 min at room temperature. Then, the disks are thoroughly 
washed in a filtration manifold with 5 aliquots of 1 ml each 
of the NMM solution, then with 5 aliquots of 1 ml 
acetonitrile/water (1/1 v/v) and subsequently dried. If not 
used immediately the disks are stored with free thiol groups 
in a solution of 1 M aqueous dithiothreitol/2-propanol (1:1 
v/v) and, before use, DTT is removed by three washings 
with 1 ml each of the NMM solution. The primer oligo- 
nucleotides with 5'-SH functionality can be prepared by 
various methods (e.g., B. C. F Chu el al., Nucleic Acids Res. 
14, 5591-5603 (1986), Sproat et al., Nucleic Acids R es. 15, 
4837^48 (1987) and Oligonucleotides and Analogues: A 
Practical Approach (F. Eckstein, editor), IRL Press Oxford, 
1991). Sequencing reactions according to the Sanger proto- 
col are performed in a standard way (e.g., H. Swerdlow et 
al., Nucleic Acids Res. 18, 1415-19 (1990)). In the presence 
of about 7-10 mM DTT the free 5 -thiol primer can be used; 
in other cases, the SH functionality can be protected, e.g., by 
a trityl group during the Sanger sequencing reactions and 
removed prior to anchoring to the support in the following 
way. The four sequencing reactions (150 ul each in an 
Eppendorf tube) are terminated by a 10 min incubation at 
70° C. to denature the DNA polymerase (such as Klenow 
fragment, Sequenase) and the reaction mixtures are ethanol 
precipitated. The supernatants are removed and the pellets 
vortexed with 25 ul of an 1 M aqueous silver nitrate solution, 
and after one hour at room temperature, 50 ul of an 1 M 
aqueous solution of DTT is added and mixed by vortexing. 
After 15 min, the mixtures are centrifuged and the pellets are 
washed twice with 100 ul ethylacetate by vortexing and 
centrifugation to remove excess DTT The primer extension 
products with free 5' -thiol group are now coupled to the 
thiolated membrane supports under mild oxidizing condi- 
tions. In general, it is sufficient to add the 5'-thiolated primer 
extension products dissolved in 10 ul 10 mM de-aerated 
triethylammonium acetate buffer (TEAA) pH 7.2 to the 
thiolated membrane supports. Coupling is achieved by dry- 
ing the samples onto the membrane disks with a cold fan. 
This process can be repeated by wetting the membrane with 
10 ul of 10 mM TEAA buffer pH 7.2 and drying as before. 
When using the 2-thiopyridyl derivatized compounds, 
anchoring can be monitored by the release of pyridine -2- 
thione spectrophotometrically at 343 nm. 

In another variation of this approach, the oligonucleotide 
primer is functionalized with an amino group at the 5'-end 
which is introduced by standard procedures during auto- 
mated DNA synthesis. After primer extension, during the 
Sanger sequencing process, the primary amino group is 
reacted with 3-(2-pyridyldithio) propionic acid 
N-hydroxysuccinimide ester (SPDP) and subsequently 
coupled to the thiolated supports and monitored by the 
release of pyridyl-2-thione as described above. After dena- 
turation of DNA polymerase and ethanol precipitation of the 
sequencing products, the supernatants are removed and the 
pellets dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and 
10 ul of a 2 mM solution of SPDP in 10 mM TEAA are 
added. The reaction mixture is vortexed and incubated for 30 
min at 25° C. Excess SPDP is then removed by three 
extractions (vortexing, centrifugation) with 50 ul each of 
ethanol and the resulting pellets are dissolved in 10 ul 10 
mM TEAA buffer pH 7.2 and coupled to the thiolated 
supports (see above). 
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The primer-extension products are purified by washing 
the membrane disks three times each with 100 ul NMM 
solution and three times with 100 ul each of 10 mM TEAA 
buffer pH 7.2. The purified primer -extension products are 
5 released by three successive treatments with 10 ul of 10 mM 
2-mercaptoethanol in 10 mM TEAA buffer pH 7.2, lyo- 
philized and analyzed by either ES or MALDI mass spec- 
trometry. 

This procedure can also be used for the mass-modified 
10 nucleic acid primers UP°" ! in an analogous and appropriate 
way, taking into account the chemical properties of the 
mass-modifying functionalities. 

EXAMPLE 2 

i5 Immobilization of Primer-extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
via the Levulinyl Group 

5-Aminolevulinic acid is protected at the primary amino 
group with the Fmoc group using 9-fluorenylmethyl 

2o N-succinimidyl carbonate and is then transformed into the 
N-hydroxysuccinimide ester (NHS ester) using 
N-hydroxysuccinimide and dicyclohexyl carbodiimide 
under standard conditions. For the Sanger sequencing 
reactions, nucleic acid primers, UP 0 " 1 , are used which are 
functionalized with a primary amino group at the 5'-end 
introduced by standard procedures during automated DNA 
synthesis with aminolinker phosphoamidites as the final 
synthetic step. Sanger sequencing is performed under stan- 
dard conditions (see above). The four reaction mixtures (150 

3o ul each in an Eppendorf tube) are heated to 70° C. for 10 min 
to inactivate the DNA polymerase, ethanol precipitated, 
centrifuged and resuspended in 10 ul of 10 mM TEAA buffer 
pH 7.2. 10 ul of a 2 mM solution of the Fmoc-5- 
aminolevuhnyl-NHS ester in 10 mM TEAA buffer is added, 

35 vortexed and incubated at 25° C. for 30 min. The excess of 
the reagent is removed by ethanol precipitation and cen- 
trifugation. The Fmoc group is cleaved off by resuspending 
the pellets in 10 ul of a solution of 20% piperidine in 
N,N-dimethylformamide/water (1:1 v/v). After 15 min at 
25° C.j piperidine is thoroughly removed by three 
precipitations/centrifugations with 100 ul each of ethanol, 
the pellets are resuspended in 10 ul of a solution of 
N-methylmorpholine, 2-propanol and water (2/10/88 v/v/v) 
and are coupled to the solid support carrying an isothiocy- 

^ 5 anate group. In the case of the DITC-Sequelon membrane 
(Millipore Corp., Bedford, Mass.), the membranes are pre- 
pared as described in EXAMPLE 1 and coupling is achieved 
on a heating block at 55° C. as described above. RNA 
extension products are immobilized in an analogous way. 

5o The procedure can be applied to other solid supports with 
isothiocyanate groups in a similar manner. 

The immobilized primer-extension products are exten- 
sively washed three times with 100 ul each of NMM solution 
and three times with 100 ul 10 mM TEAA buffer pH 7.2. The 
purified primer-extension products are released by three 
successive treatments with 10 ul of 100 mM hydrazinium 
acetate buffer pH 6.5, lyophilized and analyzed by either ES 
or MALDI mass spectrometry. 

EXAMPLE 3 

60 Immobilization of Primer-extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
via a Trypsin Sensitive Linkage 

Sequelon DITC membrane disks of 8 mm diameter 
(Millipore Corp., Bedford, Mass.) are wetted with 10 ul of 

65 NMM solution (N-methylmorp ho line/prop anaol- 2/water; 
2/49/49 v/v/v) and a linker arm introduced by reaction with 
10 ul of a 10 mM solution of 1,6-diaminohexane in NMM. 
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The excess diamine is removed by three washing steps with 
100 ul of NMM solution. Using standard peptide synthesis 
protocols, two L-lysine residues are attached by two suc- 
cessive condensations with N-Fmoc-N-tBoc-L-lysine 
pentafluorophenylester, the terminal Fmoc group is removed 
with piperidine in NMM and the free a-amino group 
coupled to 1,4-phenylene diisothiocyanate (DITC). Excess 
DITC is removed by three washing steps with 100 ul 
2-propanol and the N-tBoc groups removed with trifluoro- 
acetic acid according to standard peptide synthesis proce- 
dures. The nucleic acid primer-extension products are pre- 
pared from oligonucleotides which carry a primary amino 
group at the S'-terminus. The four Sanger DNA sequencing 
reaction mixtures (150 ul each in Eppendorf tubes) are 
heated for 10 min at 70° C. to inactivate the DNA 
polymerase, ethanol precipitated, and the pellets resus- 
pended in 10 ul of a solution of N-methylmorpholine, 
2-propanol and water (2/10/88 v/v/v). This solution is trans- 
ferred to the Lys-Lys-DITC membrane disks and coupled on 
a heating block set at 55° C. After drying, 10 ul of NMM 
solution is added and the drying process repeated. 

The immobilized primer-extension products are exten- 
sively washed three times with 100 ul each of NMM solution 
and three times with 100 ul each of 10 mM TEAA buffer pH 
7.2. For mass spectrometric analysis, the bond between the 
primer-extension products and the solid support is cleaved 
by treatment with trypsin under standard conditions and the 
released products analyzed by either ES or MALDI mass 
spectrometry with trypsin serving as an internal mass stan- 
dard. 

EXAMPLE 4 

Immobilization of Primer-extension Products of Sanger 
DNA Sequencing Reaction for Mass Spectrometric Analysis 
via Pyrophosphate Linkage 

The DITC Sequelon membrane (disks of 8 mm diameter) 
are prepared as described in EXAMPLE 3 and 10 ul of a 10 
mM solution of 3-aminopyridine adenine dinucleotide 
(APAD) (Sigma) in NMM solution added. The excess APAD 
is removed by a 10 ul wash of NMM solution and the disks 
are treated with 10 ul of 10 mM sodium periodate in NMM 
solution (15 min, 250° C). Excess periodate is removed and 
the primer-extension products of the four Sanger DNA 
sequencing reactions (150 ul each in Eppendorf rubes) 
employing nucleic acid primers with a primary amino group 
at the 5 '-end are ethanol precipitated, dissolved in 10 ul of 
a solution of N-methylmorpholine/2-propanol/water (2/10/ 
88 v/v/v) and coupled to the 2'3'-dialdehydo groups of the 
immobilized NAD analog. 

The primer-extension products are extensively washed 
with the NMM solution (3 times with 100 ul each) and 10 
mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the 
purified primer-extension products are released by treatment 
with either NADase or pyrophosphatase in 10 mM TEAA 
buffer at pH 7.2 at 37° C. for 15 min, lyophilized and 
analyzed by either ES or MALDI mass spectrometry, the 
enzymes serving as internal mass standards. 

EXAMPLE 5 

Synthesis of Nucleic Acid Primers Mass-modified by Gly- 
cine Residues at the 5'-position of the Sugar Moiety of the 
Terminal Nucleoside 

Oligonucleotides are synthesized by standard automated 
DNA synthesis using P-cyanoethylphosphoamidites (H. K 
oster et al., Nucleic Acids Res. 12, 4539 (1984)) and a 
5' -amino group is introduced at the end of solid phase DNA 
synthesis (e.g. Agrawal et al., Nucleic Acids Res. 14, 
6227^15 (1986) or Sproat et al., Nucleic Acids Res. 15, 
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6181-96 (1987)). The total amount of an oligonucleotide 
synthesis, starting with 0.25 umol CPG-bound nucleoside, is 
deprotected with concentrated aqueous ammonia, purified 
via OligoPAK™ Cartridges (Millipore Corp., Bedford, 

5 Mass.) and lyophilized. This material with a 5'-terminal 
amino group is dissolved in 100 ul absolute N,N- 
dimethylformamide (DMF) and condensed with 10 //mole 
N-Fmoc-glycine pentafluorophenyl ester for 60 min at 25° 
C. After ethanol precipitation and centrifugation, the Fmoc 

ID group is cleaved off by a 10 min treatment with 100 ul of a 
solution of 20% piperidine in N,N-dimethylformamide. 
Excess piperidine, DMF and the cleavage product from the 
Fmoc group are removed by ethanol precipitation and the 
precipitate lyophilized from 10 mM TEAA buffer pH 7.2. 

15 This material is now either used as primer for the Sanger 
DNA sequencing reactions or one or more glycine residues 
(or other suitable protected amino acid active esters) are 
added to create a series of mass-modified primer oligonucle- 
otides suitable for Sanger DNA or RNA sequencing. Immo- 

20 bilization of these mass-modified nucleic acid primers Up 0 i 
after primer-extension during the sequencing process can be 
achieved as described, e.g., in EXAMPLES 1 to 4. 

EXAMPLE 6 

Synthesis of Nucleic Acid Primers Mass-modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
Glycine Residues 

Starting material was 5-(3-aminopropynyl-l)-3'5'-di-p- 
tolyldeoxyuridine prepared and 3'5'-de-0-acylated accord- 
ing to literature procedures (Haralambidis et al., Nucleic 

30 Acids Res. 15, 4857-76 (1987)). 0.281 g (1.0 mmole) 
5-(3-aminopropynyl-l)-2'-deoxyuridine were reacted with 
0.927 g (2.0 mmole) N-Fmoc-glycine pentafluoropheny- 
lester in 5 ml absolute N,N-dimethylformamide in the pres- 

35 ence of 0.129 g (1 mmole; 174 ul) N,N- 
diisopropylethylamine for 60 min at room temperature. 
Solvents were removed by rotary evaporation and the prod- 
uct was purified by silica gel chromatography (Kieselgel 60, 
Merck; column: 2.5x50 cm, elution with chloroform/ 

^ o methanol mixtures). Yield was 0.44 g (0.78 mmole, 78%). In 
order to add another glycine residue, the Fmoc group is 
removed with a 20 min treatment with 20% solution of 
piperidine in DMF, evaporated in vacuo and the remaining 
solid material extracted three times with 20 ml ethylacetate. 

^ 5 After having removed the remaining ethylacetate, N-Fmoc- 
glycine pentafLuorophenylester is coupled as described 
above. 5-(3-(N-Fmoc-glycyl)-amidopropynyl-l)-2'- 
deoxyuridine is transformed into the 5'-0- 
dimethoxytritylated nucleoside-3'-0-p-cyanoethyl-N,N- 

5Q diisopropylphosphoamidite and incorporated into automated 
oligonucleotide synthesis by standard procedures (H. Koster 
et al., Nucleic Acids Res. 12, 2261 (1984)). This glycine 
modified thymidine analogue building block for chemical 
DNA synthesis can be used to substitute one or more of the 

55 thymidine/uridine nucleotides in the nucleic acid primer 
sequence. The Fmoc group is removed at the end of the solid 
phase synthesis with a 20 min treatment with a 20% solution 
of piperidine in DMF at room temperature. DMF is removed 
by a washing step with acetonitrile and the oligonucleotide 

g0 deprotected and purified in the standard way. 

EXAMPLE 7 

Synthesis of a Nucleic Acid Primer Mass-modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
[3- alanine Residues 
65 Starting material was the same as in EXAMPLE 6. 0.281 
g (1.0 mmole) 5-(3-Aminopropynyl-l)-2 , -deoxyuridine was 
reacted with N-Fmoc-P -alanine pentafluorophenylester 
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(0.955 g, 2.0 mmole) in 5 ml N,N-dimethylformamide 
(DMF) in the presence of 0.129 g (174 ul; 1.0 mmole) 
N,N-disopropylethylamine for 60 min at room temperature. 
Solvents were removed and the product purified by silica gel 
chromatography as described in EXAMPLE 6. Yield was 
0.425 g (0.74 mmole, 74%). Another [3-alamne moiety can 
be added in exactly the same way after removal of the Fmoc 
group. The preparation of the 5'-0-dimethoxytritylated 
nucleoside-3'-0-P-cyanoethyl-N,N- 
diisopropylphosphoamidite from 5-(3-(N-Fmoc-(3-alanyl)- 
amidopropynyl-l)-2 , -deoxyuridine and incorporation into 
automated oligonucleotide synthesis is performed under 
standard conditions. This building block can substitute for 
any of the thymidine/uridine residues in the nucleic acid 
primer sequence. In the case of only one incorporated 
mass-modified nucleotide, the nucleic acid primer molecules 
prepared according to EXAMPLES 6 and 7 would have a 
mass difference of 14 daltons. 

EXAMPLE 8 

Synthesis of a Nucleic Acid Primer Mass-modified at C-5 of 
the Heterocyclic Base of a Pyrimidine Nucleoside with 
Ethylene Glycol Monomethyl Ether 

As a nucleosidic component, 5-(3-aminopropynyl-l)-2'- 
deoxyuridine was used in this example (see EXAMPLES 6 
and 7). The mass-modifying functionality was obtained as 
follows: 7.61 g (100.0 mmole) freshly distilled ethylene 
glycol monomethyl ether dissolved in 50 ml absolute pyri- 
dine was reacted with 10.01 g (100.0 mmole) recrystallized 
succinic anhydride in the presence of 1.22 g (10.0 mmole) 
4-N,N-dimethylaminopyridine overnight at room tempera- 
ture. The reaction was terminated by the addition of water 
(5.0 ml), the reaction mixture evaporated in vacuo, 
co-evaporated twice with dry toluene (20 ml each) and the 
residue redissolved in 100 ml dichloromethane. The solution 
was extracted successively, twice with 10% aqueous citric 
acid (2x20 ml) and once with water (20 ml) and the organic 
phase dried over anhydrous sodium sulfate. The organic 
phase was evaporated in vacuo, the residue redissolved in 50 
ml dichloromethane and precipitated into 500 ml pentane 
and the precipitate dried in vacuo. Yield was 13.12 g (74.0 
mmole; 74%). 8.86 g (50.0 mmole) of succinylated ethylene 
glycol monomethyl ether was dissolved in 100 ml dioxane 
containing 5% dry pyridine (5 ml) and 6.96 g (50.0 mmole) 
4-nitrophenol and 10.32 g (50.0 mmole) dicyclohexylcar- 
bodiimide was added and the reaction run at room tempera- 
ture for 4 hours. Dicyclohexylurea was removed by 
filtration, the filtrate evaporated in vacuo and the residue 
redissolved in 50 ml anhydrous DMF. 12.5 ml (about 12.5 
mmole 4-nitrophenylester) of this solution was used to 
dissolve 2.81 g (10.0 mmole) 5-(3-aminopropynyl-l)-2'- 
deoxyuridine. The reaction was performed in the presence of 
1.01 g (10.0 mmole; 1.4 ml) triethylamine at room tempera- 
ture overnight. The reaction mixture was evaporated in 
vacuo, co-evaporated with toluene, redissolved in dichlo- 
romethane and chromatographed on silicagel (Si60, Merck; 
column 4x50 cm) with dichloromethane/methanol mixtures. 
The fractions containing the desired compound were 
collected, evaporated, redissolved in 25 ml dichloromethane 
and precipitated into 250 ml pentane. The dried precipitate 
of 5-(3-N-(0-succinyl ethylene glycol monomethyl ether) 
amidopropynyl-l)-2'-deoxyuridine (yield: 65%) is 5'-0- 
dimethoxytritylated and transformed into the nucleoside -3'- 
0-(3-cyanoethyl-N,N-diisopropylphosphoamidite and 
incorporated as a building block in the automated oligo- 
nucleotide synthesis according to standard procedures. The 
mass-modified nucleotide can substitute for one or more of 
the thymidine/uridine residues in the nucleic acid primer 
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sequence. Deprotection and purification of the primer oli- 
gonucleotide also follows standard procedures. 

EXAMPLE 9 

Synthesis of a Nucleic Acid Primer Mass-modified at C-5 of 

5 the Heterocyclic Base of a Pyrimidine Nucleoside with 
Diethylene Glycol Monomethyl Ether 

Nucleosidic starting material was as in previous 
examples, 5-(3-aminopropynyl-l)-2'-deoxyuridine. The 
mass-modifying functionality was obtained similar to 

10 EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled 
diethylene glycol monomethyl ether dissolved in 50 ml 
absolute pyridine was reacted with 10.01 g (100.0 mmole) 
recrystallized succinic anhydride in the presence of 1.22 g 
(10.0 mmole) 4-N,N-dimethylaminopyridine (DMAP) over- 

15 night at room temperature. The work-up was as described in 
EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3%). 
11.06 g (50.0 mmole) of succinylated diethylene glycol 
monomethyl ether was transformed into the 
4-nitrophenylester and, subsequently, 12.5 mmole was 

20 reacted with 2.81 g (10.0 mmole) of 5-(3-aminopropynyl- 
l)-2'-deoxyuridine as described in EXAMPLE 8. Yield after 
silica gel column chromatography and precipitation into 
pentane was 3.34 g (6.9 mmole, 69%). After dimethoxytri- 
tylation and transformation into the nucleoside-p- 

25 cyanoethylphosphoamidite, the mass-modified building 
block is incorporated into automated chemical DNA syn- 
thesis according to standard procedures. Within the 
sequence of the nucleic acid primer UP 0 " 1 , one or more of the 
thymidine/uridine residues can be substituted by this mass- 

30 modified nucleotide. In the case of only one incorporated 
mass-modified nucleotide, the nucleic acid primers of 
EXAMPLES 8 and 9 would have a mass difference of 44.05 
daltons. 

35 EXAMPLE 10 

Synthesis of a Nucleic Acid Primer Mass-modified at C-8 of 
the Heterocyclic Base of Deoxyadenosine with Glycine 

Starting material was N e -benzoyl-8-bromo-5'-0-(4,4'- 
dimethoxytrityl)-2 1 -deoxyadenosine prepared according to 
literature (Singh et al., Nucleic Acids Res. 18, 3339^45 
(1990)). 632.5 mg (1.0 mmole) of this 8-bromo- 
deoxyadenosine derivative was suspended in 5 ml absolute 
ethanol and reacted with 251.2 mg (2.0 mmole) glycine 
methyl ester (hydrochloride) in the presence of 241.4 mg 

^ 5 (2.1 mmole; 366 ul) N,N-diisopropylethylamine and 
refluxed until the starting nucleosidic material had disap- 
peared (4-6 hours) as checked by thin layer chromatography 
(TLC). The solvent was evaporated and the residue purified 
by silica gel chromatography (column 2.5x50 cm) using 

5Q solvent mixtures of chloroform/methanol containing 0.1% 
pyridine. The product fractions were combined, the solvent 
evaporated, the fractions dissolved in 5 ml dichloromethane 
and precipitated into 100 ml pentane. Yield was 487 mg 
(0.76 mmole, 76%). Transformation into the corresponding 

55 nucleoside-(3 -cyanoethylphosphoamidite and integration 
into automated chemical DNA synthesis is performed under 
standard conditions. During final deprotection with aqueous 
concentrated ammonia, the methyl group is removed from 
the glycine moiety. The mass-modified building block can 

6Q substitute one or more deoxyadenosine/adenosine residues 
in the nucleic acid primer sequence. 

EXAMPLE 11 

Synthesis of a Nucleic Acid Primer Mass-modified at C-8 of 
the Heterocyclic Base of Deoxyadenosine with Glycylgly- 
65 cine 

This derivative was prepared in analogy to the glycine 
derivative of EXAMPLE 10. 632.5 mg (1.0 mmole) 
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N 6 -Benzoyl-8-bromo-5 , -0-(4,4'-dimethoxytrityl)-2'- 
deoxyadenosine was suspended in 5 ml absolute ethanol and 
reacted with 324.3 mg (2.0 mmole) glycyl-glycine methyl 
ester in the presence of 241.4 mg (2.1 mmole, 366 fid) 
N,N-diisopropylethylamine. The mixture was refluxed and 5 
completeness of the reaction checked by TLC. Work-up and 
purification was similar to that described in EXAMPLE 10. 
Yield after silica gel column chromatography and precipi- 
tation into pentane was 464 mg (0.65 mmole, 65%). Trans- 
formation into the nucleoside-|3-cyanoethylphosphoamidite 10 
and into synthetic oligonucleotides is done according to 
standard procedures. In the case where only one of the 
deoxyadenosine/adenosine residues in the nucleic acid 
primer is substituted by this mass-modified nucleotide, the 
mass difference between the nucleic acid primers of 15 
EXAMPLES 10 and 11 is 57.03 daltons. 

EXAMPLE 12 
Synthesis of a Nucleic Acid Primer Mass-modified at the 
C-2' of the Sugar Moiety of 2'-amino-2 f -deoxythymidine ^ 
with Ethylene Glycol Monomethyl Ether Residues 

Starting material was 5'-0-(4,4-dimethoxytrityl)-2'- 
amino-2'-deoxy thymidine synthesized according to pub- 
lished procedures (e.g., Verheyden et al., J. Org. Chem. 36, 
250-254 (1971), Sasaki et al., J. Org. Chem. 41, 3138-3143 
(1976); Imazawa et al., J. Org. Chem. 44, 2039-2041 25 
(1979); Hobbs et al., J. Org. Chem. 42, 714-719 (1976); 
Ikehara et al., Chem. Pharm. Bull Japan 26, 240-244 
(1978); see also PCT Application WO 88/00201). 5-0-(4, 
4-Dimethoxytrityl)-2'-amino-2'-deoxythymidine (559.62 3Q 
mg; 1.0 mmole) was reacted with 2.0 mmole of the 
4-nitrophenyl ester of succinylated ethylene glycol monom- 
ethyl ether (see EXAMPLE 8) in 10 ml dry DMF in the 
presence of 1.0 mmole (140 ,«1) triethylamine for 18 hours 
at room temperature. The reaction mixture was evaporated 35 
in vacuo, co-evaporated with toluene, redissolved in dichlo- 
romethane and purified by silica gel chromatography (Si60, 
Merck; column: 2.5x50 cm; eluent: chloroform/methanol 
mixtures containing 0.1% triethylamine). The product con- 
taining fractions were combined, evaporated and precipi- 
tated into pentane. Yield was 524 mg (0.73 mmol; 73%). 
Transformation into the nucleoside -f3-cyanoe thy 1-N,N- 
diisopropylphosphoamidite and incorporation into the auto- 
mated chemical DNA synthesis protocol is performed by 
standard procedures. The mass-modified deoxythymidine ^ 
derivative can substitute for one or more of the thymidine 
residues in the nucleic acid primer. 

In an analogous way, by employing the 4-nitrophenyl 
ester of succinylated diethylene glycol monomethyl ether 
(see EXAMPLE 9) and triethylene glycol monomethyl 5Q 
ether, the corresponding mass-modified oligonucleotides are 
prepared. In the case of only one incorporated mass- 
modified nucleoside within the sequence, the mass differ- 
ence between the ethylene, diethylene and triethylene glycol 
derivatives is 44.05, 88.1 and 132.15 daltons respectively. ^ 

EXAMPLE 13 
Synthesis of a Nucleic Acid Primer Mass-modified in the 
Internucleotidic Linkage via Alkylation of Phosphorothioate 
Groups 

Phosphorothioate-containing oligonucleotides were pre- 60 
pared according to standard procedures (see e.g. Gait et al., 
Nucleic Acids Res., 19 1183 (1991)). One, several or all 
intemucleotide linkages can be modified in this way. The 
(-)-M13 nucleic acid primer sequence (17-mer) 
5 ' -dGTAAAACGACGGCCAGT was synthesized in 0.25 65 
//mole scale on a DNA synthesizer and one phosphorothioate 
group introduced after the final synthesis cycle (G to T 


coupling). Sulfurization, deprotection and purification fol- 
lowed standard protocols. Yield was 31.4 nmole (12.6% 
overall yield), corresponding to 31.4 nmole phosphorothio- 
ate groups. Alkylation was performed by dissolving the 
residue in 31.4 fid TE buffer (0.01 M Tris pH 8.0, 0.001 M 
EDTA) and by adding 16 fid of a solution of 20 mM solution 
of 2-iodoethanol (320 nmole; i.e., 10-fold excess with 
respect to phosphorothioate diesters) in N,N- 
dimethylformamide (DMF). The alkylated oligonucleotide 
was purified by standard reversed phase HPLC (RP-18 
Ultraphere, Beckman; column: 4.5x250 mm; 100 mM tri- 
ethylammonium acetate, pH 7.0 and a gradient of 5 to 40% 
acetonitrile). 

In a variation of this procedure, the nucleic acid primer 
containing one or more phosphorothioate phosphodiester 
bond is used in the Sanger sequencing reactions. The primer- 
extension products of the four sequencing reactions are 
purified as exemplified in EXAMPLES 1—4, cleaved off the 
solid support, lyophilized and dissolved in 4 fid each of TE 
buffer pH 8.0 and alkylated by addition of 2 fid of a 20 mM 
solution of 2-iodoethanol in DMF. It is then analyzed by ES 
and/or MALDI mass spectrometry. 

In an analogous way, employing instead of 2-iodoethanol, 
e.g., 3-iodopropanol, 4-iodobutanol mass-modified nucleic 
acid primer are obtained with a mass difference of 14.03, 
28.06 and 42.03 daltons respectively compared to the 
unmodified phosphorothioate phosphodiester-containing 
oligonucleotide. 

EXAMPLE 14 
Synthesis of 2 , -amino-2'-deoxyuridine-5 , -triphosphate and 
3 T -amino-2\3'-dideoxythymidine-5'-triphosphate Mass- 
modified at the 2'- or 3 '-amino Function with Glycine or 
|3- alanine Residues 

Starting material was 2 T -azido-2'-deoxyuridine prepared 
according to literature (Verheyden et al., J. Org. Chem. 36, 
250 (1971)), which was 4,4-dimethoxytritylated at 5'-OH 
with 4,4-dimethoxytrityl chloride in pyridine and acetylated 
at 3'-OH with acetic anhydride in a one-pot reaction using 
standard reaction conditions. With 191 mg (0.71 mmole) 
2'-azido-2'-deoxyuridine as starting material, 396 mg (0.65 
mmol, 90.8%) 5 , -0-(4,4-dimethoxytrityl)-3 , -0-acetyl-2'- 
azido-2'-deoxuridine was obtained after purification via 
silica gel chromatography. Reduction of the azido group was 
performed using published conditions (Barta et al, Tetrahe- 
dron 46, 587-594 (1990)). Yield of 5'-0-(4,4- 
dimethoxytrityl)-3 , -0-acetyl-2'-amino-2'-deoxyuridine after 
silica gel chromatography was 288 mg (0.49 mmole; 76%). 
This protected 2'-amino-2'-deoxyuridine derivative (588 mg, 
1.0 mmole) was reacted with 2 equivalents (927 mg, 2.0 
mmole) N-Fmoc-glycine pentafluorophenyl ester in 10 ml 
dry DMF overnight at room temperature in the presence of 
1.0 mmole (174 fid) N,N-diisopropylethylamine. Solvents 
were removed by evaporation in vacuo and the residue 
purified by silica gel chromatography. Yield was 711 mg 
(0.71 mmole, 82%). Detritylation was achieved by a one 
hour treatment with 80% aqueous acetic acid at room 
temperature. The residue was evaporated to dryness, 
co-evaporated twice with toluene, suspended in 1 ml dry 
acetonitrile and 5'-phosphorylated with POCl 3 according to 
literature (Yoshikawa et al., Bull. Chem. Soc. Japan 42, 3505 
(1969) and Sowa et al., Bull. Chem. Soc. Japan 48, 2084 
(1975)) and directly transformed in a one-pot reaction to the 
5 '-triphosphate using 3 ml of a 0.5 M solution (1.5 mmole) 
tetra (tri-n-butylammonium) pyrophosphate in DMF accord- 
ing to literature (e.g. Seela et al, Helvetica ChimicaActa 74, 
1048 (1991)). The Fmoc and the 3'-0-acetyl groups were 
removed by a one-hour treatment with concentrated aqueous 
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ammonia at room temperature and the reaction mixture 
evaporated and lyophilized. Purification also followed stan- 
dard procedures by using anion-exchange chromatography 
on DEAE-Sephadex with a linear gradient of triethylammo- 
nium bicarbonate (0.1 M-1.0 M). Triphosphate containing 
fractions (checked by thin layer chromatography on poly- 
ethyleneimine cellulose plates) were collected, evaporated 
and lyophilized. Yield (by UV-absorbance of the uracil 
moiety) was 68% (0.48 mmole). 

A glycyl-glycine modified 2-^^0-2' -deoxyuridine-5'- 
triphosphate was obtained by removing the Fmoc group 
from S'-O-^^-dimethoxytrityl^'-O-acetyl^-N^N- 
9-fluorenylmethyloxycarbonyl-glycyl)-2 , -amino-2 f - 
deoxyuridine by a one-hour treatment with a 20% solution 
of piperidine in DMF at room temperature, evaporation of 
solvents, two-fold co-evaporation with toluene and subse- 
quent condensation with N-Fmoc-glycine pentafluorophenyl 
ester. Starting with 1.0 mmole of the 2 , -N-glycyl-2'-amino- 
2'-deoxyuridine derivative and following the procedure 
described above, 0.72 mmole (72%) of the corresponding 
2 , -(N-glycyl-glycyl)-2 , -amino-2 , -deoxyuridine-5'- 
triphosphate was obtained. 

Starting with S'-O^^-dimethoxytrity^-B'-O-acetyl^'- 
amino-2'-deoxyuridine and coupling with N-Fmoc-f5- 
alanine pentafluorophenyl ester, the corresponding 2'-(N-[3- 
alanyl)-2'-amino-2 1 -deoxyuridine-5'-triphosphate can be 
synthesized. These modified nucleoside triphosphates are 
incorporated during the Sanger DNA sequencing process in 
the primer-extension products. The mass difference between 
the glycine, (3 -alanine and glycyl-glycine mass-modified 
nucleosides is, per nucleotide incorporated, 58.06, 72.09 and 
115.1 daltons respectively. 

When starting with S'-O-^^-dimethoxytrityl^'-amino- 
2 l ,3'-dideoxythymidine (obtained by published procedures, 
see EXAMPLE 12), the corresponding 3'-(N-glycyl)-3'- 
amino-/3'-(-N-glycyl-glycyl)-3 , -amino-/and3 p -(N-p-alanyl)- 
3'-amino-2',3'-dideoxythymidine-5 , -triphosphates can be 
obtained. These mass-modified nucleoside triphosphates 
serve as a terminating nucleotide unit in the Sanger DNA 
sequencing reactions providing a mass difference per termi- 
nated fragment of 58.06, 72.09 and 115.1 daltons respec- 
tively when used in the multiplexing sequencing mode. The 
mass-differentiated fragments can then be analyzed by ES 
and/or MALDI mass spectrometry. 

EXAMPLE 15 
Synthesis of Deoxyuridine-S'-triphosphate Mass- modified at 
C-5 of the Heterocyclic Base with Glycine, Glycyl-glycine 
and p -alanine Residues 

0.281 g (1.0 mmole) 5-(3-Aminopropynyl-l)-2'- 
deoxyuridine (see EXAMPLE 6) was reacted with either 
0.927 g (2.0 mmole) N-Fmoc-glycine pentafiuoropheny- 
lester or 0.955g (2.0 mmole) N-Fmoc-P -alanine pentafluo- 
rophenyl ester in 5 ml dry DMF in the presence of 0.129 g 
N,N-diisopropylethylamine (174 ul, 1.0 mmole) overnight at 
room temperature. Solvents were removed by evaporation in 
vacuo and the condensation products purified by flash chro- 
matography on silica gel (Still et al., J. Org. Chem. 43, 
2923-2925 (1978)). Yields were 476 mg (0.85 mmole: 85%) 
for the glycine and 436 mg (0.76 mmole; 76%) for the 
[3-alanine derivatives. For the synthesis of the glycyl-glycine 
derivative, the Fmoc group of 1.0 mmole Fmoc-glycine- 
deoxyuridine derivative was removed by one-hour treatment 
with 20% piperidine in DMF at room temperature. Solvents 
were removed by evaporation in vacuo, the residue was 
co-evaporated twice with toluene and condensed with 0.927 
g (2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester and 
purified as described above. Yield was 445 mg (0.72 mmole; 
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72%). The glycyl-, glycyl-glycyl- and p-alanyl-2'- 
deoxyuridine derivatives, N-protected with the Fmoc group 
were transformed to the 3'-0- acetyl derivatives by tritylation 
with 4,4-dimethoxytrityl chloride in pyridine and acetylation 

5 with acetic anhydride in pyridine in a one-pot reaction and 
subsequently detritylated by one hour treatment with 80% 
aqueous acetic acid according to standard procedures. Sol- 
vents were removed, the residues dissolved in 100 ml 
chloroform and extracted twice with 50 ml 10% sodium 
bicarbonate and once with 50 ml water, dried with sodium 
sulfate, the solvent evaporated and the residues purified by 
flash chromatography on silica gel. Yields were 361 mg 
(0.60 mmole; 71%) for the glycyl-, 351 mg (0.57 mmole; 
75%) for the (3-alanyl- and 323 mg (0.49 mmole; 68%) for 

i5 the glycyl-glycyl-S-O'-acetyl^'-deoxyuridine derivatives 
respectively. Phosphorylation at the 5'-OH with POCl 3 , 
transformation into the 5'-triphosphate by in-situ reaction 
with tetra(tri-n-butylammonium) pyrophosphate in DMF, 
3-de-O-acetylation, cleavage of the Fmoc group, and final 

2Q purification by anion-exchange chromatography on DEAE- 
Sephadex was performed as described in EXAMPLE 14. 
Yields according to UV-absorbance of the uracil moiety 
were 0.41 mmole 5-(3-(N-glycyl)-amidopropynyl-l)-2'- 
deoxyuridine-5'-triphosphate (84%), 0.43 mmole 5-(3-(N-(3- 
alanyl)-amidopropynyl-l)-2'-deoxyuridine-5' -triphosphate 

25 (75%) and 0.38 mmole 5-(3-(N-glycyl-glycyl)- 
amidopropynyl-l)-2 , -deoxyuridine-5 l -triphosphate (78%). 

These mass-modified nucleoside triphosphates were 
incorporated during the Sanger DNA sequencing primer- 

3o extension reactions. 

When using 5-(3-aminopropynyl-l)-2',3'-dideoxyuridine 
as starting material and following an analogous reaction 
sequence the corresponding glycyl-, glycyl-glycyl- and 
P-alany 1-2', 3'-dideoxyuridine-5'- triphosphates were 

35 obtained in yields of 69, 63 and 71% respectively. These 
mass-modified nucleoside triphosphates serve as chain- 
terminating nucleotides during the Sanger DNA sequencing 
reactions. The mass-modified sequencing ladders are ana- 
lyzed by either ES or MALDI mass spectrometry. 

40 EXAMPLE 16 

Synthesis of 8-glycyl- and S-glycyl-glycyl-2'- 
deoxy adenosine-5 1 -triphosphate 

727 mg (1.0 mmole) of N s -(4-tert-butylphenoxyacetyl)- 
8 -glycyl -5'-(4,4-dimethoxytrityl)-2'-deoxy adenosine or 800 

45 mg (1.0 mmole) N 6 -(4-tert-butylphenoxyacetyl)-8-glycyl- 
glycyl-5'-(4,4-dimethoxytrityl)-2 , -deoxyadenosine prepared 
according to EXAMPLES 10 and 11 and literature (Koster 
et al., Tetrahedron 37, 362 (1981)) were acetylated with 
acetic anhydride in pyridine at the 3'-OH, detritylated at the 

50 5'-position with 80% acetic acid in a one-pot reaction and 
transformed into the 5'-triphosphates via phosphorylation 
with POCl 3 and reaction in-situ with tetra(tri-n- 
butylammonium)pyrophosphate as described in EXAMPLE 
14. Deprotection of the N 6 -tert-butylphenoxy acetyl, the 

55 3-O-acetyl and the O-methyl group at the glycine residues 
was achieved with concentrated aqueous ammonia for 
ninety minutes at room temperature. Ammonia was removed 
by lyophilization and the residue washed with 
dichloromethane, solvent removed by evaporation in vacuo 

60 and the remaining solid material purified by anion-exchange 
chromatography on DEAE-Sephadex using a linear gradient 
of triethylammonium bicarbonate from 0.1 to 1.0 M. The 
nucleoside triphosphate containing fractions (checked by 
TLC on polyethyleneimine cellulose plates) were combined 

65 and lyophillized. Yield of the 8-glycyl-2'-deoxyadenosine- 
5' -triphosphate (determined by UV-absorbance of the 
adenine moiety) was 57% (0.57 mmole). The yield for the 
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S-glycyl-glycyl^'-deoxyadenosine-S'-triphosphate was 
51% (0.51 mmole). 

These mass-modified nucleoside triphosphates were 
incorporated during primer-extension in the Sanger DNA 
sequencing reactions. 5 

When using the corresponding N6-(4-tert- 
butylphenoxyacetyl)-8-glycyl- or -glycyl-glycly-5'-0-(4,4- 
dimethoxytrityl^^S'-dideoxyadenosine derivatives as start- 
ing materials prepared according to standard procedures 
(see, e.g., for the introduction of the 2',3 '-function: Seela et ID 
aL, Helvetica Chimica Acta 74, 1048-1058 (1991)) and 
using an analogous reaction sequence as described above, 
the chain-terminating mass-modified nucleoside triphos- 
phates 8-glycyl- and S-glycyl-glycyl-2',3'- 
dideoxyadenosine-5 T -triphosphates were obtained in 53 and 15 
47% yields respectively. The mass-modified sequencing 
fragment ladders are analyzed by either ES or MALDI mass 
spectrometry. 

EXAMPLE 17 2Q 
Mass-modification of Sanger DNA Sequencing Fragment 
Ladders by Incorporation of Chain-elongating 2'-deoxy- and 
Chain-terminating 2',3'-dideoxythymidine-5 , -(alpha-S-)- 
triphosphate and Subsequent Alkylation with 2-iodoethanol 
and 3-iodopropanol 

2',3'-Dideoxythymidine-5 , -(alpha-S)-triphosphate was 
prepared according to published procedures (e.g., for the 
alpha-S- triphosphate moiety: Eckstein et aL, Biochemistry 
15, 1685 (1976) and Accounts Chem. Res. 12, 204 (1978) 
and for the 2',3'-dideoxy moiety: Seela et aL, Helvetica 
Chimica Acta, 74, 1048-1058 (1991)). Sanger DNA 30 
sequencing reactions employing 2'-deoxythymidine-5'- 
(alpha-S)-triphosphate are performed according to standard 
protocols (e.g. Eckstein, Ann. Rev. Biochem. 54, 367 
(1985)). When using 2 r ,3 , -dideoxythymidine-5 , -(alpha-S)- 35 
triphosphates, this is used instead of the unmodified 2',3'- 
dideoxythymidine-5'-triphosphate in standard Sanger DNA 
sequencing (see e.g. Swerdlow et aL, Nucleic Acids Res. 18, 
1415-1419 (1990)). The template (2 pmole) and the nucleic 
acid M13 sequencing primer (4 pmole) modified according 
to EXAMPLE 1 are annealed by heating to 65° C. in 100 ul 40 
of 10 mM Tris-HCl pH 7.5, 10 mM MgCl 2 , 50 mM NaCl, 
7 mM dithiothreitol (DTT) for 5 min and slowly brought to 
37° C. during a one hour period. The sequencing reaction 
mixtures contain, as exemplified for the T-specific termina- ^ 
tion reaction, in a final volume of 150 ul, 200 uM (final 
concentration) each of dATP, dCTP, dTTP, 300 uM 
c7-deaza-dGTP, 5 uM 2',3'-dideoxythymidine-5 , -(alpha-S)- 
triphosphate and 40 units Sequenase (United States 
Biochemicals). Polymerization is performed for 10 min at ^ 
37° C, the reaction mixture heated to 70° C. to inactivate the 
Sequenase, ethanol precipitated and coupled to thiolated 
Sequelon membrane disks (8 mm diameter) as described in 
EXAMPLE 1. Alkylation is performed by treating the disks 
with 10 ul of 10 mM solution of either 2-iodoethanol or 
3-iodopropanol in NMM (N-methylmorpholine/water/2- 
propanol, 2/49/49, v/v/v) (three times), washing with 10 ul 
NMM (three times) and cleaving the alkylated T-terminated 
primer-extension products off the support by treatment with 
DTT as described in EXAMPLE 1. Analysis of the mass- gQ 
modified fragment families is performed with either ES or 
MALDI mass spectrometry. 

EXAMPLE 18 
Analysis of a Mixture of Oligothymidylic Acids 

Oligothymidylic acid, oligo p(dT) 12 _ 18 , is commercially 65 
available (United States Biochemical, Cleveland, Ohio). 
Generally, a matrix solution of 0.5 M in ethanol was 


prepared. Various matrices were used for this Example and 
Examples 19-21 such as 3,5-dihydroxybenzoic acid, sina- 
pinic acid, 3 -hy droxypicolinic acid, 2,4,6- 
trihydroxyacetophenone. Oligonucleotides were lyophilized 
after purification by HPLC and taken up in ultrapure water 
(MilliQ, Millipore) using amounts to obtain a concentration 
of 10 pmoles/wl as stock solution. An aliquot (1 jul) of this 
concentration or a dilution in ultrapure water was mixed 
with 1 fA of the matrix solution on a flat metal surface 
serving as the probe tip and dried with a fan using cold air. 
In some experiments, cation-ion exchange beads in the acid 
form were added to the mixture of matrix and sample 
solution. 

MALDI-TOF spectra were obtained for this Example and 
Examples 19-21 on different commercial instruments such 
as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons 
Instruments), LaserTec Research (Vestec). The conditions 
for this Example were linear negative ion mode with an 
acceleration voltage of 25 kV The MALDI-TOF spectrum 
generated is shown in FIG. 14. Mass calibration was done 
externally and generally achieved by using defined peptides 
of appropriate mass range such as insulin, gramicidin S, 
trypsinogen, bovine serum albumen, and cytochrome C. All 
spectra were generated by employing a nitrogen laser with 
5 nsec pulses at a wavelength of 337 nm. Laser energy 
varied between 10 6 and 10 7 W/cm 2 . To improve signal-to- 
noise ratio generally, the intensities of 10 to 30 laser shots 
were accumulated. 

EXAMPLE 19 
Mass Spectrometric Analysis of a 50-mer and a 99-mer 

Two large oligonucleotides were analyzed by mass spec- 
trometry. The 50-mer d (TAA- 
C GGTC ATTAC GGCCATTGACTGTAGG ACCTGC ATT- 
ACATGACTAGCT) (SEQ ID NO:3) and dT(pdT) 99 were 
used. The oligodeoxynucleotides were synthesized using 
[3-cyanoethylphosphoamidites and purified using published 
procedures.(e.g. N. D. Sinha, J. Biernat, J. McManus and H. 
Koster, Nucleic Acids Res., 12, 4539 (1984)) employing 
commercially available DNA synthesizers from either Mil- 
lipore (Bedford, Mass.) or Applied Biosystems (Foster City, 
Calif.) and HPLC equipment and RP18 reverse phase col- 
umns from Waters (Milford, Mass.). The samples for mass 
spectrometric analysis were prepared as described in 
Example 18. The conditions used for MALDI-MS analysis 
of each oligonucleotide were 500 fmol of each 
oligonucleotide, reflectron positive ion mode with an accel- 
eration of 5 kV and postacceleration of 20 kV. The MALDI- 
TOF spectra generated were superimposed and are shown in 
FIG. 15. 

EXAMPLE 20 
Simulation of the DNA Sequencing Results of FIG. 2 

The 13 DNA sequences representing the nested 
dT- terminated fragments of the Sanger DNA sequencing for 
the 50-mer described in Example 19 (SEQ ID NO:3) were 
synthesized as described in Example 19. The samples were 
treated and 500 fmol of each fragment was analyzed by 
MALDI-MS as described in Example 18. The resulting 
MALDI-TOF spectra are shown in FIG. 16. The conditions 
were reflectron positive ion mode with an acceleration of 5 
kV and postacceleration of 20 kV. Calculated molecular 
masses and experimental molecular masses are shown in 
Table 1. 

The MALDI-TOF spectra were superimposed (FIG. 17) 
to demonstrate that the individual peaks are resolvable even 
between the 10-mer and 11-mer (upper panel) and the 
37-mer and 38-mer (lower panel). The two panels show two 
different scales and the spectra analyzed at that scale. 
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EXAMPLE 21 
MALDI-MS Analysis of a Mass-Modified Oligonucleotide 
A 17-mer was mass-modified at C-5 of one or two 
deoxyuridine moieties. 5-[13-(2-Methoxyethoxyl)- 
tridecyne-l-yl]-5'-0-(4,4'-dimethoxytrityl)-2'-deoxyuridme- 
3'-P-cyanoethyl-N,N-diisopropylphosphoamidite was used 
to synthesize the modified 17-mers using the methods 
described in Example 19. 

The modified 17-mers were 

(SEQ ID NO:4) 

X 
I 

a: d (TAAAACGACGGCCAGUG) (molecular mass: 5454) 

(SEQ ID NO:5) 

X X 

b: d (UAAAACGACGGCCAGUG) (molecular mass: 5634) 

where X = C^C (CH 2 )n — OH 

(unmodified 17-mer: molecular mass: 5273) 

The samples were prepared and 500 fmol of each modi- 
fied 17-mer was analyzed using MALDI-MS as described in 
Example 18. The conditions used were reflectron positive 
ion mode with an acceleration of 5 kV and postacceleration 
of 20 kV. The MALDI-TOF spectra which were generated 
were superimposed and are shown in FIG. 18. 

EXAMPLE 22 
Detection of Polymerase Chain Reaction Products Contain- 
ing 7-Deazapurine 

Materials and Methods 
PCR Amplifications 

The following oligodeoxynucleotide primers were either 
synthesized according to standard phosphoamidite chemis- 
try (Sinha, N. D,. et al., (1983) Tetrahedron Let. Vol. 24, Pp. 
5843-5846; Sinha, N. D., et al., (1984) Nucleic Acids Res., 
Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthe- 
sizer (Millipore, Bedford, Mass., USA) in 200 nmol scales 
or purchased from MWG-Biotech (Ebersberg, Germany, 
primer 3) and Biometra (Goettingen, Germany, primers 
6-7). 

primer 1: 5'-GTCACCCTCGACCTGCAG (SEQ. ID. NO. 

6) ; 

primer 2: 5'-TTGTAAAACGACGGCCAGT (SEQ. ID. NO. 

7) ; 

primer 3: 5 CTTCC ACCGCGATGTTG A (SEQ. ID. NO. 

8) ; 

primer 4: 5 ' - C AG G AAAC AG CTATG AC (SEQ. ID. NO. 9); 
primer 5: 5 '- GTAAAACGACGGCC AGT (SEQ. ID. NO. 
10); 

primer 6: 5'-GTCACCCTCGACCTGCAgC (g: RiboG) 

(SEQ. ID. NO. 11); 
primer 7: 5-GTTGTAAAACGAGGGCCAgT (g: RiboG) 

(SEQ. ID. NO. 12); 

The 99-mer and 200-mer DNA strands (modified and 
unmodified) as well as the ribo- and 7-deaza-modified 
100-mer were amplified from pRFcl DNA (10 ng, gener- 
ously supplied S. Feyerabend, University of Hamburg) in 
100 fiL reaction volume containing 10 mmol/L KC1, 10 
mmol/L (NH 4 ) 2 S0 4 , 20 mmol/L Tris HC1 (pH=8.8), 2 
mmol/L MgS0 4 , (exo(-yPseudococcus furiosus (Pfu) 
-Buffer, Pharmacia, Freiburg, Germany), 0.2 mmol/L each 
dNTP (Pharmacia, Freiburg, Germany), 1 ^mol/L of each 
primer and 1 unit of exo(-)Pfu DNA polymerase 
(Stratagene, Heidelberg, Germany). 
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For the 99-mer primers 1 and 2, for the 200-mer primers 
1 and 3 and for the 100-mer primers 6 and 7 were used. To 
obtain 7-deazapurine modified nucleic acids, during PCR- 
amplification dATP and dGTP were replaced with 7-deaza- 

5 dATP and 7-deaza-dGTP. The reaction was performed in a 
thermal cycler (OmniGene, MWG-Biotech, Ebersberg, 
Germany) using the cycle: denaturation at 95° C. for 1 min., 
annealing at 51° C. for 1 min. and extension at 72° C. for 1 
min. For all PCRs the number of reaction cycles was 30. The 

10 reaction was allowed to extend for additional 10 min. at 72° 
C. after the last cycle. 

The 103-mer DNA strands (modified and unmodified) 
were amplified from M13mpl8 RFI DNA (100 ng, 
Pharmacia, Freiburg, Germany) in 100 j«L reaction volume 

15 using primers 4 and 5 all other concentrations were 
unchanged. The reaction was performed using the cycle: 
denaturation at 95° C. for 1 min., annealing at 40° C. for 1 
min. and extension at 72° C. for 1 min. After 30 cycles for 
the unmodified and 40 cycles for the modified 103-mer 

20 respectively, the samples were incubated for additional 10 
min. at 72° C. 

Synthesis of 5'-[ 32 -P]-labeled PCR-primers 

Primers 1 and 4 were 5'-[ 32 -P] -labeled employing 

25 T4-polynucleotidkinase (Epicentre Technologies) and 
( Y - 32 P)-ATP. (BLU/JNGG/502A, Dupont, Germany) 
according to the protocols of the manufacturer. The reactions 
were performed substituting 10% of primer 1 and 4 in PCR 
with the labeled primers under otherwise unchanged 

30 reaction-conditions. The amplified DNAs were separated by 
gel electrophoresis on a 10% polyacrylamide gel. The appro- 
priate bands were excised and counted on a Packard TRI- 
CARB 460C liquid scintillation system (Packard, Conn., 
USA). 

35 Primer-cleavage from Ribo -modified PCR-product 

The amplified DNA was purified using Ultrafree-MC 

filter units (30,000 NMWL), it was then redissolved in 100 

{d of 0.2 mol/L NaOH and heated at 95° C. for 25 minutes. 

The solution was then acidified with HC1 (1 mol/L) and 
40 further purified for MALDI-TOF analysis employing 

Ultrafree-MC filter units (10,000 NMWL) as described 

below. 

Purification of PCR Products 

45 All samples were purified and concentrated using 
Ultrafree-MC units 30000 NMWL (Millipore, Eschborn, 
Germany) according to the manufacturer's description. After 
lyophilisation, PCR products were redissolved in 5 //L (3 fiL 
for the 200-mer) of ultrapure water. This analyte solution 

5Q was directly used for MALDI-TOF measurements. 
MALDI-TOF MS 

Aliquots of 0.5 ,ttL of analyte solution and 0.5 //L of matrix 
solution (0.7 mol/L 3 -HPA and 0.07 mol/L ammonium 
citrate in acetonitrile/water (1:1, v/v)) were mixed on a fiat 

55 metallic sample support. After drying at ambient tempera- 
ture the sample was introduced into the mass spectrometer 
for analysis. The MALDI-TOF mass spectrometer used was 
a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, 
Germany). Spectra were recorded in the positive ion reflec- 

60 tor mode with a 5 keV ion source and 20 keV postaccel- 
eration. The instrument was equipped with a nitrogen laser 
(337 nm wavelength). The vacuum of the system was 
3^-TO -8 hPa in the analyzer region and l^-40 -7 hPa in the 
source region. Spectra of modified and unmodified DNA 

65 samples were obtained with the same relative laser power; 
external calibration was performed with a mixture of syn- 
thetic oligodeoxy nucleotides (7- to 50-mer). 
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Results and Discussion 
Enzymatic Synthesis of 7-deazapurine Nucleotide Contain- 
ing Nucleic Acids by PCR 

In order to demonstrate the feasibility of MALDI-TOF 
MS for the rapid, gel-free analysis of short PCR products 
and to investigate the effect of 7-deazapurine modification of 
nucleic acids under MALDI-TOF conditions, two different 
primer-template systems were used to synthesize DNA 
fragments. Sequences are displayed in FIGS. 24 and 25. 
While the two single strands of the 103 -mer PCR product 
had nearly equal masses (Am=8 u), the two single strands of 
the 99-mer differed by 526 u. 

Considering the facts that 7-deaza purine nucleotide 
building blocks for chemical DNA synthesis are approxi- 
mately 160 times more expensive than regular ones (Product 
Information, Glen Research Corporation, Sterling, Va.) and 
their application in standard |3-cyano-phosphoamidite chem- 
istry is not trivial (Product Information, Glen Research 
Corporation, Sterling, Va.; Schneider, K and B. T. Chait 
(1995) Nucleic Acids Res. 23, 1570) the cost of 7-deaza 
purine modified primers would be very high. Therefore, to 
increase the applicability and scope of the method, all PCRs 
were performed using unmodified oligonucleotide primers 
which are routinely available. Substituting dATP and dGTP 
by c 7 -dATP and c 7 -dGTP in polymerase chain reaction led 
to products containing approximately 80% 7-deaza-purine 
modified nucleosides for the 99-mer and 103-mer; and about 
90% for the 200-mer, respectively. Table II shows the base 
composition of all PCR products. 
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shows lower yields for the modified PCR-products. The 
bands were excised from the gel and counted. For all PCR 
products the yield of the modified nucleic acids was about 
50%, referring to the corresponding unmodified amphfica- 

5 tion product. Further experiments showed that exo(-) 
Deep Vent and Vent DNA polymerase were able to incorpo- 
rate c 7 -dATP and c 7 -dGTP during PCR as well The overall 
performance, however, turned out to be best for the exo(-) 
Pfu DNA polymerase giving least side products during 
amplification. Using all three polymerases, it was found that 
such PCRs employing c 7 -dATP and c 7 -dGTP instead of their 
isosteres showed less side -reactions giving a cleaner PCR- 
product. Decreased occurrence of amplification side prod- 
ucts may be explained by a reduction of primer mismatches 
due to a lower stability of the complex formed from the 

15 primer and the 7-deaza-purine containing template which is 
synthesized during PCR. Decreased melting point for DNA 
duplexes containing 7-deaza-purine have been described 
(Mizusawa, S. et al., (1986) Nucleic Acids Res., 14, 
1319-1324). In addition to the three polymerases specified 

20 above (exo(-) Deep Vent DNA polymerase, Vent DNA 
polymerase and exo(-) (Pfu) DNA polymerase), it is antici- 
pated that other polymerases, such as the Large Klenow 
fragment of E. coli DNA polymerase, Sequenase, Taq DNA 
polymerase, and U AmpliTaq, AmpliTaq or AmpliTaq TS 

25 DNApolymerase can be used. In addition, where RNAis the 
template, RNA polymerases, such as the SP6 or the T7 RNA 
polymerase, must be used 

MALDI-TOF Mass Spectrometry of Modified and Unmodi- 
fied PCR Products 


TABLE II 


Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products 
(unmodified and 7-deaza purine modified) 

DNA-fragments 1 C T A G c 7 -deaza-A c T -deaza-G rel. modification 2 


200-mers 

54 

34 

56 

56 




modified 200-mer s 

54 

34 

6 

5 

50 

51 

90% 

200-mer a 

56 

56 

34 

54 




modified 200-mer a 

56 

56 

3 

4 

31 

50 

92% 

103-mer s 

28 

23 

24 

25 




modified 103-mer s 

28 

23 

6 

5 

18 

23 

79% 

103-mer a 

28 

24 

23 

28 




modified 103-mer a 

28 

24 

7 

4 

16 

24 

78% 

99-mer s 

34 

21 

24 

20 




modified 99-mer s 

34 

21 

6 

5 

18 

15 

75% 

99-mer a 

20 

24 

21 

34 




modified 99-mer a 

20 

24 

3 

4 

18 

30 

87% 


U V and "a" describe "sense" and "antisense" strands of the double-strande< 
2 indicates relative modification as percentage of 7-deaza purine modified nu 
amount of purine nucleotides. 

However, it remained to be determined whether 80-90% 
7-deaza-purine modification would be sufficient for accurate 
mass spectrometer detection. It was therefore important to 
determine whether all purine nucleotides could be substi- 
tuted during the enzymatic amplification step. It was found 
that exo(-)Pseudococcusfuriosus (Pfu) DNA polymerase 
indeed could accept c 7 -dATP and c 7 -dGTP in the absence of 
unmodified purine triphosphates. However, the incorpora- 
tion was less efficient leading to a lower yield of PCR 
product (FIG. 26). Ethidium-bromide stains by intercalation 
with the stacked bases of the DNA-doublestrand. Therefore 
lower band intensities in the ethidium-bromide stained gel 
might be artifacts since the modified DNA-strands do not 
necessarily need to give the same band intensities as the 
unmodified ones. 

To verify these results, the PCRs with [ 32 P] -labeled 
primers were repeated. The autoradiogram (FIG. 27) clearly 


PCR product, 
deotides of total 

50 

The 99-mer, 103-mer and 200-mer PCR products were 
analyzed by MALDI-TOF MS. Based on past experience, it 
was known that the degree of depurination depends on the 
laser energy used for desorption and ionization of the 

55 analyte. Since the influence of 7-deazapurine modification 
on fragmentation due to depurination was to be investigated, 
all spectra were measured at the same relative laser energy. 

FIGS. 28a and 28b show the mass spectra of the modified 
and unmodified 103-mer nucleic acids. In case of the modi- 

60 fled 103-mer, fragmentation causes a broad (M+H)""" signal. 
The maximum of the peak is shifted to lower masses so that 
the assigned mass represents a mean value of (M+H) + signal 
and signals of fragmented ions, rather than the (M+H)* 
signal itself. Although the modified 103-mer still contains 

65 about 20% A and G from the oligonucleotide primers, it 
shows less fragmentation which is featured by much more 
narrow and symmetric signals. Especially peak tailing on the 
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lower mass side due to depurination, is substantially 
reduced. Hence, the difference between measured and cal- 
culated mass is strongly reduced although it is still below the 
expected mass. For the unmodified sample a (M+H) + signal 
of 31670 was observed, which is a 97 u or 0.3% difference 
to the calculated mass. While, in case of the modified sample 
this mass difference diminished to 10 u or 0.03% (31713 u 
found, 31723 u calculated). These observations are verified 
by a significant increase in mass resolution of the (M+H) + 
signal of the two signal strands (m/Am=67 as opposed to 18 
for the unmodified sample with Am=full width at half 
maximum, fwhm). Because of the low mass difference 
between the two single strands (8 u) their individual signals 
were not resolved. 

With the results of the 99 base pair DNA fragments the 
effects of increased mass resolution for 7-deazapurine con- 
taining DNA becomes even more evident. The two single 
strands in the unmodified sample were not resolved even 
though the mass difference between the two strands of the 
PCR product was very high with 526 u due to unequal 
distribution of purines and pyrimidines (FIG. 29a). In con- 
trast to this, the modified DNA showed distinct peaks for the 
two single strands (FIG. 29b) which makes the superiority of 
this approach for the determination of molecular weights to 
gel electrophoretic methods even more profound. Although 
base line resolution was not obtained the individual masses 
were abled to be assigned with an accuracy of 0.1%:Am=27 
u for the lighter (calc. mass=30224 u) and Am=14 u for the 
heavier strand (calc. mass=30750 u). Again, it was found 
that the full width at half maximum was substantially 
decreased for the 7-deazapurine containing sample. 

In case of both the 99-mer and 103-mer the 7-deazapurine 
containing nucleic acids seem to give higher sensitivity 
despite the fact that they still contain about 20% unmodified 
purine nucleotides. To get comparable signal-to-noise ratio 
at similar intensities for the (M+H) + signals, the unmodified 
99-mer required 20 laser shots in contrast to 12 for the 
modified one and the 103-mer required 12 shots for the 
unmodified sample as opposed to three for the 7-deazapurine 
nucleoside-containing PCR product. 

Comparing the spectra of the modified and unmodified 
200-mer amplicons, improved mass resolution was again 
found for the 7-deazapurine containing sample as well as 
increased signal intensities (FIGS. 30a and 30b). While the 
signal of the single strands predominates in the spectrum of 
the modified sample the DNA-suplex and dimers of the 
single strands gave the strongest signal for the unmodified 
sample. 

A complete 7-deaza purine modification of nucleic acids 
may be achieved either using modified primers in PCR or 
cleaving the unmodified primers from the partially modified 
PCR product. Since disadvantages are associated with modi- 
fied primers, as described above, a 100-mer was synthesized 
using primers with a ribo-modification. The primers were 
cleaved hydrolytically with NaOH according to a method 
developed earlier in our laboratory (Koster, H. et al, Z. 
Physiol. Chem. 359:1570-1589). FIGS. 31a and 316 display 
the spectra of the PCR product before and after primer 
cleavage. FIG. 31b shows that the hydrolysis was success- 
ful: Both hydrolyzed PCR product as well as the two 
released primers could be detected together with a small 
signal from residual uncleaved 100-mer. This procedure is 
especially useful for the MALDI-TOF analysis of very short 
PCR-products since the share of unmodified purines origi- 
nating from the primer increases with decreasing length of 
the amplified sequence. 

The remarkable properties of 7-deazapurine modified 
nucleic acids can be explained by either more effective 
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desorption and/or ionization, increased ion stability and/or a 
lower denaturation energy of the double stranded purine 
modified nucleic acid. The exchange of the N-7 for a 
methine group results in the loss of one acceptor for a 

5 hydrogen bond which influences the ability of the nucleic 
acid to form secondary structures due to non-Watson-Crick 
base pairing (Seela, F. and A. Kehne (1987) Biochemistry, 
26, 2232-2238.), which should be a reason for better des- 
orption during the MALDI process. In addition to this the 

10 aromatic system of 7-deazapurine has a lower electron 
density that weakens Watson-Crick base pairing resulting in 
a decreased melting point (Mizusawa, S. et al., (1986) 
Nucleic Acids Res., 14, 1319-1324) of the double-strand. 
This effect may decrease the energy needed for denaturation 

15 of the duplex in the MALDI process. These aspects as well 
as the loss of a site which probably will carry a positive 
charge on the N-7 nitrogen renders the 7-deazapurine modi- 
fied nucleic acid less polar and may promote the effective- 
ness of desorption. 

20 Because of the absence of N-7 as proton acceptor and the 
decreased polarizaiton of the C — N bond in 7-deazapurine 
nucleosides depurination following the mechanisms estab- 
lished for hydrolysis in solution is prevented. Although a 
direct correlation of reactions in solution and in the gas 

25 phase is problematic, less fragmentation due to depurination 
of the modified nucleic acids can be expected in the MALDI 
process. Depurination may either be accompanied by loss of 
charge which decreases the total yield of charged species or 
it may produce charged fragmentation products which 

30 decreases the intensity of the non fragmented molecular ion 
signal. 

The observation of both increased sensitivity and 
decreased peak tailing of the (M+H) + signals on the lower 
mass side due to decreased fragmentation of the 

35 7-deazapurine containing samples indicate that the N-7 atom 
indeed is essential for the mechanism of depurination in the 
MALDI-TOF process. In conclusion, 7-deazapurine con- 
taining nucleic acids show distinctly increased ion-stability 
and sensitivity under MALDI-TOF conditions and therefore 

40 provide for higher mass accuracy and mass resolution. 

EXAMPLE 23 
Solid State Sequencing and Mass Spectrometer Detection 

45 Materials and Methods 

Oligonucleotides were purchased from Operon Technolo- 
gies (Alameda, Calif.) in an unpurified form. Their 
sequences are listed in Table III. Sequencing reactions were 
performed on a solid surface using reagents from the 
50 sequencing kit for Sequenase Version 2.0 (Amersham, 
Arlington Heights, 111.). 


55 Sequencing a 39-mer target 

Sequencing complex: 
5'- 

TCTGGCCTGGTGC AGGGCCTTATTGTAGTTGTGACGTACA- (A* 7 ) a - 3 ' 
(DNA11683) (SEQ ID. No. 13) 

3 TCAAC ACTGCATGT- 5 ' (PNA16/DNA) 
(SEQ. ID. No. 14) 


In order to perform solid-state DNA sequencing, template 
strand DNA11683 was 3'-biotinylated by terminal deoxy- 
65 nucleotidyl transferase. A 30 jul reaction, containing 60 pmol 
of DNA11683, 1.3 nmol of biotin 14-dATP (GIBCO BRL, 
Grand Island, N.Y.), 30 units of terminal transferase 
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42 


(Amersham, Arlington Heights, 111.), and lx reaction buffer 
(supplied with enzyme), was incubated at 37° C. for 1 hour. 
The reaction was stopped by heat inactivation of the termi- 
nal transferase at 70° C. for 10 min. The resulting product 
was desalted by passing through a TE-10 spin column 5 
(Clone tech). More than one molecules of biotin-14-dATP 
could be added to the 3'-end of DNA11683. The biotinylated 
DNA11683 was incubated with 0.3 mg of Dynal streptavidin 
beads in 30 fA lx binding and washing buffer at ambient 
temperature for 30 min. The beads were washed twice with 10 
TE and redissolved in 30 fA TE, 10 jul aliquot (containing 0.1 
mg of beads) was used for sequencing reactions. 

The 0.1 mg beads from previous step were resuspended in 
a 10 fA volume containing 2 (A of 5x Sequenase buffer (200 
mM Tris-HCl, pH 7.5, 100 mM MgC12, and 250 mM NaCl) 15 
from the Sequenase kit and 5 pmol of corresponding primer 
FN A 1 6/DNA. The annealing mixture was heated to 70° C. 
and allowed to cool slowly to room temperature over a 
20-30 min time period. Then 1 fA 0.1 M dithiothreitol 2 o 
solution, 1 fA Mn buffer (0.15 M sodium isocitrate and 0.1 
M McC12), and 2 fA of diluted Sequenase (3.25 units) were 
added. The reaction mixture was divided into four aliquots 
of 3 fA each and mixed with termination mixes (each consists 
of 3 fA of the appropriate termination mix: 32 fAS/L c7dATP, 25 
32 uM dCTP, 32 //M c7dGTP, 32 //M dTTP and 3.2 uM of 
one of the four ddTNPs, in 50 mM NaCl). The reaction 
mixtures were incubated at 37° C. for 2 min. After the 
completion of extension, the beads were precipitated and the 30 
supernatant was removed. The beads were washed twice and 
resuspended in TE and kept at 4° C 


volume containing 2 fA of 5x Sequenase buffer (200 mM 
Tris-HCl, pH 7.5, 100 mM MgCll, and 250 mM NaCl) from 
the Sequenase kit. The annealing mixture was heated to 65° 
C. and allowed to cool slowly to 37° C. over a 20—30 min 
time period. The duplex primer was then mixed with 10 
pmol of TSlo (20 pmol of TS10 for 0.2 mg of beads) in 1 
fi\ volume, and the resulting mixture was further incubated 
at 37° C. for 5 min, room temperature for 5-10 min. Then 

1 fA 0.1 M dithiothreitol solution, 1 fA Mn buffer (0.15 M 
sodium isocitrate and 0.1 M MnCl 2 ), and 2 fA of diluted 
Sequenase (3.25 units) were added. The reaction mixture 
was divided into four aliquots of 3 fA each and mixed with 
termination mixes (each consists of 4 {A of the appropriate 
termination mix: 16 fAS/L dATP, 16 /dM dCTP, 16 uM dGTP, 
16 juM dTTP and 1.6 ^M of one of the four ddNTPs, in 50 
mM NaCl). The reaction mixtures were incubated at room 
temperature for 5 min, and 37° C. for 5 min. After the 
completion of extension, the beads were precipitated and the 
supernatant was removed. The beads were resuspended in 20 
/A TE and kept at 4° C. An aliquot of 2 jul (out of 20 jul) from 
each tube was taken and mixed with 8 fA of formamide, the 
resulting samples were denatured at 90—95° C. for 5 min and 

2 fA (out of 10 fA total) was applied to an ALF DNA 
sequencer (Pharmacia, Piscataway, N.J.) using a 10% poly- 
acrylamide gel containing 7 M urea and 0.6x THE. The 
remaining aliquot was used for MALDI-TOFMS analysis. 
MALDI Sample Preparation and Instrumentation 

Before MALDI analysis, the sequencing ladder loaded 
magnetic beads were washed twice using 50 mM ammo- 
nium citrate and resuspended in 0.5 fA pure water. The 


Sequencing a 78-mer target 


Sequencing complex: 

5 '-AAGATCTGACCAGOGAITCCH^TTAGCCTTGACTGCTCKTrGCTGCTGCTGCTC^ 

TGGATGATCCGACGCATCAGATCTGG- (A b ) n -3 (SEQ. ID. NO. 15) 
(TNR. PLASM2) 

3 '- CTACTAGGCTGCGTAGTC- 5 1 (CM1) (SEQ. ID. NO. 16) 


The target TNR.PLASM2 was biotinylated and 
sequenced using procedures similar to those described in 
previous section (sequencing a 39-mer target). 


Sequencing a 15-mer target with partially duplex probe 
Sequencing complex: 

5 - F-G ATGATCCG ACGCATC ACAGCTC 3 (SEQ. ID. No. 17) 
3> -b- 

CTACTAGGCTGCGTAGTGTCGAGAACCTTGGCT 3 ' 
(SEQ. ID. No. 18) 


CM1B3B was immobilized on Dynabeads M280 with 
streptavidin (Dynal, Norway) by incubating 60 pmol of 
CM1B3B with 0.3 magnetic beads in 30 jul 1 M NaCl and 
TE (lx binding and washing buffer) at room temperature for 
30 min. The beads were washed twice with TE and redis- 
solved in 30 jul TE, 10 or 20 fA aliquot (containing 0.1 or 0.2 
mg of beads respectively) was used for sequencing reac- 
tions. 

The duplex was formed by annealing corresponding ali- 
quot of beads from previous step with 10 pmol of DFlla5F 
(or 20 pmol of DFlla5F for 0.2 mg of beads) in a 9 jul 


suspension was then loaded onto the sample target of the 
45 mass spectrometer and 0.5 fA of saturated matrix solution 
(3-hydropicolinic acid (HPA): ammonium citrate=10:l mole 
ratio in 50% acetonitrile) was added. The mixture was 
allowed to dry prior to mass spectometer analysis. 

The reflectron TOFMS mass spectrometer (Vision 2000, 
50 Finnigan MAT, Bremen, Germany) was used for analysis. 5 
kV was applied in the ion source and 20 kV was applied for 
postacceleration. All spectra were taken in the positive ion 
mode and a nitrogen laser was used. Normally, each spec- 
trum was averaged for more than 100 shots and a standard 
55 25-point smoothing was applied. 

Results and Discussions 
Conventional Solid-state Sequencing 

In conventional sequencing methods, a primer is direcdy 

60 annealed to the template and then extended and terminated 
in a Sanger dideoxy sequencing. Normally, a biotinylated 
primer is used and the sequencing ladders are captured by 
streptavidin-coated magnetic beads. After washing, the 
products are eluted from the beads using EDTA and forma- 

65 mide. However, our previous findings indicated that only the 
annealed strand of a duplex is desorbed and the immobilized 
strand remains on the beads (Tang, K. et al., (1995) Nucleic 
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Acids Research 23:3126-3131). Therefore, it is advanta- 
geous to immobilize the template and anneal the primer. 
After the sequencing reaction and washing, the beads with 
the immobilized template and annealed sequencing ladder 
can be loaded directly onto the mass spectrometer target and 
mix with matrix. In MALDI, only the annealed sequencing 
ladder will be desorbed and ionized, and the immobilized 
template will remain on the target. 

A 39-mer template (SEQ. ID. No. 13) was first biotiny- 
lated at the 3' end by adding biotin-14-dATP with terminal 
transferase. More than one biotin-14-dATP molecule could 
be added by the enzyme. However, since the template was 
immobilized and remained on the beads during MALDI, the 
number of biotin-14-dATP would not affect the mass spec- 
tra. A 14-mer primer (SEQ. ID. No. 14) was used for the 
solid-state sequencing. MALDI -TOF mass spectra of the 
four sequencing ladders are shown in FIG. 32, and the 
expected theoretical values are shown in Table III. The 
sequencing reaction produced a relatively homogenous 
ladder, and the full-length sequence was determined easily. 
One peak around 5150 appeared in all reactions are not 
identified. A possible explanation is that a small portion of 
the template formed some kind of secondary structure, such 
as a loop, which hindered sequenase extension. Mis- 
incorporation is of minor importance, since the intensity of 
these peaks were much lower than that of the sequencing 
ladders. Although 7-deaza purines were used in the sequenc- 
ing reaction, which could stabilize the N-glycosidic bond 
and prevent depurination, minor base losses were still 
observed since the primer was not substituted by 
7-deazapurines. The full length ladder, with a ddA at the 3' 
end, appeared in the A reaction with an apparent mass of 
11899.8. However, a more intense peak of 122 appeared in 
all four reactions and is likely due to an addition of an extra 
nucleotide by the Sequenase enzyme. 
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The same technique could be used to sequence longer 
DNA fragments. A 78-mer template containing a CTG 
repeat (SEQ. ID. No. 15) was 3'-biotinylated by adding 
biotin-14-dATP with terminal transferase. An 18-mer primer 

5 (SEQ. ID. No. 16) was annealed right outside the CTG 
repeat so that the repeat could be sequenced immediately 
after primer extension. The four reactions were washed and 
analyzed by MALDI-TOFMS as usual. An example of the 
G-reaction is shown in FIG. 33 and the expected sequencing 

10 ladder is shown in Table IV with theoretical mass values for 
each ladder component. All sequencing peaks were well 
resolved except the last component (theoretical value 
20577.4) was indistinguishable from the background. Two 
neighboring sequencing peaks (a 62-mer and a 63-mer) were 

15 also separated indicating that such sequencing analysis 
could be applicable to longer templates. Again, an addition 
of an extra nucleotide by the Sequenase enzyme was 
observed in this spectrum. This addition is not template 
specific and appeared in all four reactions which makes it 

20 easy to be identified. Compared to the primer peak, the 
sequencing peaks were at much lower intensity in the long 
template case. Further optimization of the sequencing reac- 
tion may be required. 

25 


30 


A-ieaction C-reaction G-reaction T-reaction 


5 '-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA- (A 13 )^ 3 ' 

3 '-TCAACACTGCATGT-5 ' 

3 '- ATCAACACTGC ATGT- 5 ' 

3 '-CATCAACACTGCATGT-5 1 

3 '-ACATCAACACTGCATGT-5 1 

3 '- AACATCAACACTGCATGT-5 ' 

3 '-TAACATCAACACTGC ATGT- 5 1 

3 '- ATAACATCAAC ACTGCATGT- 5 1 

3 '- GATAACATCAAC ACTGC ATGT- 5 1 

3 '- GG ATAACATCAAC ACTGCATGT- 5 ' 

3 '- CGGATAAC ATCAACACTGC ATGT- 5 ' 

3 '- CCGG ATAACATCAAC ACTGCATGT- 5 1 

3 '-CCCGCATAAC ATCAACACTGC ATGT-5' 

3 '-TCCCGGATAACATCAACACTGCATGT-5' 

3 '- GTCCCCKiATAACATCAAC ACTGCATGT-5 ' 

3 '- CGTCCCGGATAAC ATCAACACTGC AIX3T-5 ' 

3 '-ACGTCCCGGATAAC ATCAACACTGC ATGT- 5 ' 

3 '- C ACGTCCCGGATAACATCAAC ACTCKIATGT- 5 ' 

3 '-CCACGTCCCGOATAACATCAACACTGC ATGT-5' 

3 '- ACCACGTCCCGGATAACATC AACACTGC ATGT- 5 ' 

3 '- CtACCACGTCCCGCt ATAACATCAAC ACTGC ATGT- 5 

3 '- CKJACC ACGTCCCCKjATAACATC AACACICtCATGT- 5 1 

3 '- CGGACC ACGTCCCGGATAACATCAACACTGCATGT-5 ' 

3 '- CCGG ACCAC GTCCCGGATAACATCAAC ACTGCATGT- 5 ' 

3 '-ACCGG ACCAC GTCCC GGATAACATCAACACTGCATGT-5 ' 

3 '- GACCGGAC CACGTC CC GGATAACATCAAC ACTGCATGT- 5 1 

3 '-AGACCGGACCACGTCCCGGATAACATCAAC ACTGCATGT- 5' 


4223.8 
4521.0 


5123.4 
5436.6 


7001.6 
7290.8 
7580.0 


9105.0 
9394.2 


10655.0 
10944.2 


6383.2 
6712.4 


10036.6 
10365.8 
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Sequencing using Duplex DNA Probes for Capturing and 
Priming 

Duplex DNA probes with single -stranded overhang have 
been demonstrated to be able to capture specific DNA 
templates and also serve as primers for solid-state sequenc- 5 
ing. The scheme is shown in FIG. 34. Stacking interactions 
between a duplex probe and a single-stranded template 
allow only 5 -base overhand to be sufficient for capturing. 
Based on this format, a 5' fluorescent-labeled 23-mer (5'- 
GAT GAT CCG ACG CAT CAC AGC TC) (SEQ. ID. No. ID 
19) was annealed to a 3'-biotinylated 18-mer (5'-GTG ATG 
CGT CGG ATC ATC) (SEQ. ID. No. 20), leaving a 5-base 
overhang. A 15-mer template (5'-TCG GTT CCA AGA 
GCT) (SEQ ID. No. 21) was captured by the duplex and 
sequencing reactions were performed by extension of the 15 
5-base overhang. MALDI-TOF mass spectra of the reactions 
are shown in FIG. 35. All sequencing peaks were resolved 
although at relatively low intensities. The last peak in each 
reaction is due to unspecific addition of one nucleotide to the 
full length extension product by the Sequenase enzyme. For 20 
comparison, the same products were run on a conventional 
DNA sequencer and a stacking fluorogram of the results is 
shown in FIGS. 36A-36D. As can be seen from the Figure, 
the mass spectra had the same pattern as the fluorogram with 
sequencing peaks at much lower intensity compared to the 25 
23-mer primer. 

Improvements of MALDI-TOF Mass Spectrometry as a 
Detection Technique 

Sample distribution can be made more homogenous and 
signal intensity could potentially be increased by imple- 30 
menting the picoliter vial technique. In practice, the samples 
can be loaded on small pits with square openings of 100 um 
size. The beads used in the solid-state sequencing is less than 
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10 um in diameter, so they should fit well in the microliter 
vials. Microcrystals of matrix and DNA containing "sweet 
spots" will be confined in the vial. Since the laser spot size 
is about 100 jum in diameter, it will cover the entire opening 
of the vial. Therefore, searching for sweet spots will be 
unnecessary and high repetition-rate laser (e.g.>10 Hz) can 
be used for acquiring spectra. An earlier report has shown 
that this device is capable of increasing the detection sen- 
sitivity of peptides and proteins by several orders of mag- 
nitude compared to conventional MALDI sample prepara- 
tion technique. 

Resolution of MALDI on DNA needs to be further 
improved in order to extend the sequencing range beyond 
100 bases. Currently, using 3-HPA/ammonium citrate as 
matrix and a reflectron TOF mass spectrometer with 5kV ion 
source and 20 kV postacceleration, the resolution of the 
run-through peak in FIG. 33 (73-mer) is greater than 200 
(FWHM) which is enough for sequence determination in 
this case. This resolution is also the highest reported for 
MALDI desorbed DNA ions above the 70-mer range. Use of 
the delayed extraction technique may further enhance reso- 
lution. 

All of the above -cited references and publications are 
hereby incorporated by reference. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to 
ascertain using no more than routine experimentation, 
numerous equivalents to the specific procedures described 
herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following 
claims. 


SEQUENCE LISTING 


(1) GENERAL INFORMATION! 


(iii) NUMBER OF SEQUENCES: 21 


(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic: acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CATGCCATGG CATG 


(2) INFORMATION FOR SEQ ID NO:2: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: other nucleic acid 


(iii) HYPOTHETICAL: YES 
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-continued 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
AAATTGTGCA CATCCTGCAG C 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE z nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(iii) HYPOTHETICAL s YES 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
TAACGGTCAT TACGGCCATT GACTGTAGGA CC T GC AT T AC ATGACTAGCT 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17 

(C) OTHER INFORMATION: /note- "N is the mass -modified 

nucleotide 5-[13-hydroxytridecyn- l-yl]-2 ' 
-deoxyuridine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TAAAACGACG GGCCAGNG 


(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 

(C) OTHER INFORMATION: /note- "N is the mass-modified 

nucleotide 5-[l3-hydroxytridecyn- l-yl]-2 ' 
-deoxyuridine" 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17 

(C) OTHER INFORMATION: /note= "N is the mass-modified 

nucleotide 5-[13-hydroxytridecyn- l-yl]-2 ' 
-deoxyuridine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

NAAAACGACG GGCCAGNG 


(2) INFORMATION FOR SEQ ID NO : 6 : 
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-continued 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
GTCACCCTCG ACCTGCAG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TTGTAAAACG ACGGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CTTCCACCGC GATGTTGA 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO :9s 
CAGGAAACAG CTATGAC 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
G TAAAACG AC GGCCAGT 


(2) INFORMATION FOR SEQ ID NO: 11: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 


US 6,194,144 Bl 
57 58 


-continued 


(D) TOPOLOGY: linear 
(ii> MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17 

(C) OTHER INFORMATION: /note= "All lowercase letters 

represent RiboG" 

(xi) sequence description: seq id NO: 11: 

GTCACCCTCG ACCTGCAgC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17 

(C) OTHER INFORMATION: /note= "All lowercase letters 

represent RiboG" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
GTTGTAAAAC GAGGGCCAgT 


(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 


(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCAACACTGC ATGT 


(2) INFORMATION FOR SEQ ID NO: 15: 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: CDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 
AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 6 0 


US 6,194,144 Bl 
59 60 


-continued 


CCGACGCATC AGATCTGG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTACTAGGCT GCGTAGTC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GATGATCCGA C GC AT C AC AG CTC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTACTAGGCT GCGTAGTGTC GAGAACCTTG GCT 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATGATCCGA C GC AT C AC AG CTC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTGATGCGTC GGATCATC 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
TCGGTTCCAAGAGCT 


What is claimed is: 

1. A method for determining the sequence of a nucleic 
acid, comprising the steps of: 

a) generating at least two base -specifically terminated 2Q 
nucleic acid fragments containing deazapurine nucle- 
otides; 

b) determining the molecular weight value of each base- 
specifically terminated fragment by mass spectrometry, 
wherein the molecular weight values of at least two 25 
base-specifically terminated fragments are determined 
concurrently; and 

c) determining the sequence of the nucleic acid by align- 
ing the base-specifically terminated nucleic acid frag- 
ments according to molecular weight. 30 

2. The method according to claim 1 wherein the nucleic 
acid fragments are purified before the step of determining 
the molecular weight values by mass spectrometry. 

3. The method according to claim 2 wherein the nucleic 
acid fragments are purified, comprising the steps of: 35 

a) reversibly immobilizing the nucleic acid fragments on 
a solid support; and 

b) washing out all remaining reactants and by-products. 

4. The method according to claim 3, further comprising 
the step of removing the nucleic acid fragments from the 40 
solid support. 

5. The method of claim 1, wherein the deazapurine 
moieties are selected from the group consisting of: 
C 7 -deazaguanine, C 7 -deazaadenine, C 7 -deazainosine 
triphosphate, C 9 -deazaadenine, C 9 -deazaguanine and 45 
C 9 -deazainosine triphosphate. 

6. The method of claim 1, wherein the deazapurine 
nucleotides comprise at least about 80% of the nucleotide 
fragment. 

7. A process of claim 1 wherein the mass spectrometer is 50 
selected from the group consisting of: Matrix- Assisted Laser 
Desorption/Ionization Time-of -Flight (MALDI-TOF), Elec- 
trospray (ES), Ion Cyclotron Resonance (ICR), and Fourier 
Transform and combinations thereof. 

8. The method according to claim 1, wherein more than 55 
one species of nucleic acid are concurrently sequenced by 
multiplex mass spectrometric nucleic acid sequencing 
employing nucleic acid primers, chain-elongating 
nucleotides, and chain -terminating nucleotides, wherein one 

of the sets of base-specifically terminated fragments is 60 
unmodified and the other sets of base -specifically terminated 
nucleic acid fragments are mass modified, and each of the 
sets of base-specifically terminated nucleic acid fragments 
has a sufficient mass difference to be distinguished from the 
others by mass spectrometry. 65 

9. The method according to claim 8, wherein at least one 
of the sets of mass-modified base -specifically terminated 


15 


fragments is modified with a mass-modifying functionality 
at a heterocyclic base of at least one nucleotide. 

10. The method according to claim 9, wherein the het- 
erocyclic base-modified nucleotide is selected from the 
group consisting of a cytosine nucleotide modified at C-5, a 
thymine nucleotide modified at C-5, a thymine nucleotide 
modified at the C-5 methyl group, a uracil nucleotide 
modified at C-5, an adenine nucleotide modified at C-8, an 
adenine nucleotide modified at C-7, a c 7 -deazaadenine 
modified at C-8, a c 7 -deazaadenine modified at C-7, a 
guanine nucleotide modified at C-8, a guanine nucleotide 
modified at C-7, a c 7 -deazaguanine modified at C-8, a 
c 7 -deazaguanine modified at C-7, a hypoxanthine modified 
at C-8, a c 7 -deazahypoxanthine modified at C-7, and a 
c 7 -deazahypoxanthine modified at C-8. 

11. The method according to claim 8, wherein at least one 
of the sets of mass-modified base -specifically terminated 
nucleic acid fragments is modified with a mass-modifying 
functionality attached to one or more phosphate moieties of 
the internucleotidic linkages of the fragments. 

12. The method according to claim 8, wherein at least one 
of the sets of mass-modified base -specifically terminated 
nucleic acid fragments is modified with a mass-modifying 
functionality attached to one or more sugar moieties of 
nucleotides within the set of mass modified base-specifically 
terminated fragments at at least one sugar position selected 
from the group consisting of a C-2' position, an external C-3' 
position, and an external C-5' position. 

13. The method according to claim 8, wherein at least one 
of the sets of mass-modified base -specifically terminated 
nucleic acid fragments is modified with a mass-modifying 
functionality (M) attached to the sugar moiety of a 
5 '-terminal nucleotide and wherein the mass-modifying 
function (M) is the linking functionality (L). 

14. The method according to claim 8, wherein a mass- 
modifying functionality (M) is attached to a set of base- 
specifically terminated nucleic acid fragments subsequent to 
generating the base-specifically terminated nucleic acid 
fragments and prior to determining the molecular weight 
values for the nested fragments by mass spectrometry. 

15. The method according to claim 14, wherein the 
base-specifically terminated nucleic acid fragments are gen- 
erated using at least one reagent selected from the group 
consisting of a nucleic acid primer, a chain-elongating 
nucleotide, a chain-terminating nucleotide and a tag probe 
which has been modified with a precursor of the mass- 
modifying functionality, M; and a subsequent step com- 
prises modifying the precursor of the mass-modifying func- 
tionality to generate the mass-modifying functionality, M, 
prior to mass spectrometric analysis. 

16. A method of sequencing a target nucleic acid, com- 
prising the steps of: 


US 6,1! 

63 

a) reversibly linking oligonucleotide primers to a solid 
support; 

b) hybridizing to the primers, at least a portion of the 
target nucleic acid and generating via chain elongation 
of the primer and subsequent termination, at least two 
base -specifically terminated nucleic acid fragments 
containing deazapurine nucleotides; 

c) determining the molecular weight value of each based- 
specifically terminated fragment by mass spectrometry, 
wherein: 

the molecular weight values of at least two base- 
specifically terminated fragments are determined 
concurrently; and 

the nested fragments are cleaved from the solid support 
during mass spectrometry; and 

d) determining the nucleotide sequence by aligning the 
base specifically terminated fragments according to 
molecular weight. 

17. The method according to claim 16 wherein the nucleic 
acid fragments are purified before the step of determining 
the molecular weight values by mass spectrometry. 

18. The method according to claim 17 wherein the nucleic 
acid fragments are purified, comprising the steps of: 

a) reversibly immobilizing the nucleic acid fragments on 
a solid support; and 

b) washing out all remaining reactants and by-products. 

19. The method according to claim 18, further comprising 
the step of removing the nucleic acid fragments from the 
solid support. 

20. The method of claim 16, wherein the deazapurine 
moieties are selected from the group consisting of: 
C 7 -deazaguanine, C 7 -deazaadenine, C 7 -deazainosine 
triphosphate, C 9 -deazaadenine, C 9 -deazaguanine and 
C 9 -deazainosine triphosphate. 

21. The method of claim 16, wherein deazapurine nucle- 
otides comprise at least about 80% of the nucleotide frag- 
ment. 

22. A process of claim 16 wherein the mass spectrometer 
is selected from the group consisting of: Matrix -Assisted 
Laser Desorption/Ionization Time-of-Flight (MALDI-TOF), 
Electrospray (ES), Ion Cyclotron Resonance (ICR), and 
Fourier Transform and combinations thereof. 

23. The method according to claim 16, wherein more than 
one species of nucleic acid are concurrendy sequenced by 
multiplex mass spectrometric nucleic acid sequencing 
employing nucleic acid primers, chain-elongating 
nucleotides, and chain-terminating nucleotides, wherein one 
of the sets of base-specifically terminated fragments is 
unmodified and the other sets of base -specifically terminated 
nucleic acid fragments are mass modified, and each of the 
sets of base-specifically terminated nucleic acid fragments 
has a sufficient mass difference to be distinguished from the 
others by mass spectrometry. 

24. The method according to claim 23, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated fragments is modified with a mass-modifying func- 
tionality (M) at a heterocyclic base of at least one nucleotide. 

25. The method according to claim 24, wherein the 
heterocyclic base-modified nucleotide is selected from the 
group consisting of a cytosine nucleotide modified at C-5, a 
thymine nucleotide modified at C-5, a thymine nucleotide 
modified at the C-5 methyl group, a uracil nucleotide 
modified at C-5, an adenine nucleotide modified at C-8, an 
adenine nucleotide modified at C-7, a c 7 -deazaadenine 
modified at C-8, a c 7 -deazaadenine modified at C-7, a 
guanine nucleotide modified at C-8, a guanine nucleotide 
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modified at C-7, a c 7 -deazaguanine modified at C-8, a 
c 7 -deazaguanine modified at C-7, a hypoxanthine modified 
at C-8, a c 7 -deazahypoxanthine modified at C-7, and a 
c 7 -deazahypoxanthine modified at C-8. 

5 26. The method according to claim 23, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to one or more phos- 
phate moieties of the internucleotidic linkages of the 

1Q fragments. 

27. The method according to claim 23, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to one or more sugar 
15 moieties of nucleotides within the set of mass modified 
base-specifically terminated fragments at at least one sugar 
position selected from the group consisting of a C-2' 
position, an external C-3' position, and an external C-5' 
position. 

20 28. The method according to claim 23, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to the sugar moiety of 
a 5 '-terminal nucleotide and wherein the mass-modifying 

25 function (M) is the linking functionality (L). 

29. The method according to claim 23, wherein a mass- 
modifying functionality (M) is attached to a set of base- 
specifically terminated nucleic acid fragments subsequent to 
generating the base-specifically terminated nucleic acid 

30 fragments and prior to determining the molecular weight 
values for the nested fragments by mass spectrometry. 

30. The method according to claim 29, wherein the 
base-specifically terminated nucleic acid fragments are gen- 
erated using at least one reagent selected from the group 

35 consisting of a nucleic acid primer, a chain-elongating 
nucleotide, a chain-terminating nucleotide and a tag probe 
which has been modified with a precursor of the mass- 
modifying functionality, M; and a subsequent step com- 
prises modifying the precursor of the mass-modifying 

40 functionality, M, to generate the mass-modifying 
functionality, M, prior to mass spectrometric analysis. 

31. A method of multiplex analysis of nucleic acid 
sequences, comprising the steps of: 

a) reversibly linking a nucleic acid primer to a solid 
45 support; 

b) hybridizing to the primers, at least a portion of the 
target nucleic acid and generating via chain elongation 
of the primer with chain-elongating nucleoside triph- 
osphates and subsequent termination with chain- 

50 terminating nucleoside triphosphates, at least two, 
base-specifically terminated nucleic acid fragments 
containing deazapurine nucleotides; 

c) determining the molecular weight value of each frag- 
ment by matrix assisted laser desorption/ionization 

55 mass spectrometry wherein the molecular weight val- 
ues of at least two base -specifically terminated frag- 
ments are determined concurrently and wherein the 
fragments are cleaved from the solid support during 
mass spectrometry; and 

60 d) determining the nucleotide sequence of the nucleic acid 
by aligning the fragments according to molecular 
weight. 

32. The method according to claim 31, wherein the 
reversible linkage is a photocleavable bond. 

65 33. The method according to claim 31 wherein the base- 
specifically terminated fragments are cleaved from the solid 
support prior to mass spectrometry. 
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34. The method of claim 31, wherein more than one 
species of nucleic acid are concurrently sequenced by mul- 
tiplex mass spectrometric nucleic acid sequencing employ- 
ing nucleic acid primers, chain-elongating nucleotides, and 
chain-terminating nucleotides, wherein one of the sets of 5 
base -specifically terminated fragments is unmodified and 
the other sets of base-specifically terminated nucleic acid 
fragments are mass modified, and each of the sets of 
base -specifically terminated nucleic acid fragments has a 
sufficient mass difference to be distinguished from the others 1Q 
by mass spectrometry. 

35. The method according to claim 34, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated fragments is modified with a mass-modifying func- 
tionality at a heterocyclic base of at least one nucleotide. 

36. The method according to claim 35, wherein the 15 
heterocyclic base-modified nucleotide is selected from the 
group consisting of a cytosine nucleotide modified at C-5, a 
thymine nucleotide modified at C-5, a thymine nucleotide 
modified at the C-5 methyl group, a uracil nucleotide 
modified at C-5, an adenine nucleotide modified at C-8, an 20 
adenine nucleotide modified at C-7, a c 7 -deazaadenine 
modified at C-8, a c 7 -deazaadenine modified at C-7, a 
guanine nucleotide modified at C-8, a guanine nucleotide 
modified at C-7, a c 7 -deazaguanine modified at C-8, a 
c 7 -deazaguanine modified at C-7, a hypoxanthine modified 25 
at C-8, a c 7 -deazahypoxanthine modified at C-7, and a 
c 7 -deazahypoxanthine modified at C-8. 

37. The method according to claim 34, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 30 
modifying functionality attached to one or more phosphate 
moieties of the internucleotidic linkages of the fragments. 

38. The method according to claim 34, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 35 
modifying functionality attached to one or more sugar 
moieties of nucleotides within the set of mass modified 
base -specifically terminated fragments at at least one sugar 
position selected from the group consisting of a C-2' 
position, an external C-3' position, and an external C-5' 40 
position. 

39. The method according to claim 34, wherein at least 
one of the sets of mass-modified base-specifically termi- 
nated nucleic acid fragments is modified with a mass- 
modifying functionality (M) attached to the sugar moiety of 45 
a 5'-terminal nucleotide and wherein the mass-modifying 
function (M) is the linking functionality (L). 

40. The method according to claim 34, wherein a mass- 
modifying functionality (M) is attached to a set of base- 
specifically terminated nucleic acid fragments subsequent to 50 
generating the base-specifically terminated nucleic acid 
fragments and prior to determining the molecular weight 
values for the nested fragments by mass spectrometry. 

41. The method according to claim 40, wherein the 
base -specifically terminated nucleic acid fragments are gen- 55 
erated using at least one reagent selected from the group 
consisting of a nucleic acid primer, a chain-elongating 
nucleotide, a chain-terminating nucleotide and a tag probe 
which has been modified with a precursor of the mass- 
modifying functionality, M; and a subsequent step com- 60 
prises modifying the precursor of the mass-modifying func- 
tionality to generate the mass-modifying functionality, M, 
prior to mass spectrometric analysis. 

42. The method of claim 1, wherein the deazapurine is 
C 7 -deazaguanine. 65 

43. A method for determining the sequence of a nucleic 
acid comprising the steps of: 


a) generating at least two base-specifically terminated 
nucleic acid fragments from a nucleic acid to be 
sequenced, wherein: 

the fragments are generated under conditions which 

comprise cation exchange; and 
the conditions permit sequencing of oligomers that are 

50-mers; 

b) determining the molecular weight value of each of the 
base-specifically terminated fragments by mass 
spectrometry, wherein the molecular weight values of 
at least two base-specifically terminated fragments are 
determined concurrently; and 

c) determining the sequence of the nucleic acid by align- 
ing the base -specifically terminated nucleic acid frag- 
ments according to molecular weight. 

44. A method for determining the sequence of a nucleic 
acid comprising the steps of: 

a) generating at least two base-specifically terminated 
nucleic acid fragments from a nucleic acid to be 
sequenced, wherein: 

the fragments are generated under conditions wherein 
the fragments contain deazapurines; and 

the conditions result in enhanced separation or resolu- 
tion of the fragments when analyzed by mass spec- 
trometry compared to the absence of the conditions; 

b) determining the molecular weight value of each of the 
base-specifically terminated fragments by mass 
spectrometry, wherein the molecular weight values of 
at least two base-specifically terminated fragments are 
determined concurrently; and 

c) determining the sequence of the nucleic acid by align- 
ing the base -specifically terminated nucleic acid frag- 
ments according to molecular weight. 

45. The method of claim 44, where the conditions permit 
sequencing of oligomers that are 50-mer. 

46. A method for determining the sequence of a nucleic 
acid, comprising the steps of: 

a) generating at least two base-specifically terminated 
nucleic acid fragments wherein: 

at least one of the nucleic acid fragments comprises two 
different mass modifications 

b) determining the molecular weight value of each base- 
specifically terminated fragment by mass spectrometry; 
and 

c) determining the sequence of the nucleic acid by align- 
ing the one or more sets of base-specifically terminated 
nucleic acid fragments according to molecular weight. 

47. The method of claim 44, wherein at least 80% of the 
nucleotides of the nucleic acid fragments are deazapurine 
nucleotides. 

48. The method of claim 43, wherein mass spectrometry 
is selected from the group consisting of: Matrix-Assisted 
Laser Desorption/lonization Time-of-Flight (MALDI-TOF), 
Electrospray (ES), Ion Cyclotron Resonance (ICR), and 
Fourier Transform and combinations thereof. 

49. The method of claim 44, wherein mass spectrometry 
is selected from the group consisting of: Matrix -Assisted 
Laser Desorption/lonization Time-of-Flight (MALDI-TOF), 
Electrospray (ES), Ion Cyclotron Resonance (ICR), and 
Fourier Transform and combinations thereof. 

50. The method of claim 43, wherein: 

a plurality of nucleic acid fragments are concurrently 
sequenced; 

one of the sets of base-specifically terminated fragments 
is unmodified and the other sets of base-specifically 
terminated nucleic acid fragments are mass modified; 
and 
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each of the sets of base -specifically terminated nucleic 
acid fragments has a sufficient mass difference from the 
others to be distinguished from the others by mass 
spectrometry. 

51. The method of claim 5, wherein the deazapurine is 
selected from the group consisting of C°-deazaadenine, 
C 9 -deazaguanine and C 9 -deazainosine. 

52. The method of claim 5, wherein the deazapurine is 
selected from the group consisting of C 7 -deazaguanine, 
C 7 -deazaadenine and C 7 -deazainosine. 

53. The method of claim 20, wherein the deazapurine is 
selected from the group consisting of C 9 -deazaadenine, 
C 9 -deazaguanine and C 9 -deazainosine. 

54. The method of claim 20, wherein the deazapurine is 
selected from the group consisting of C 7 -deazaguanine, 
C 7 -deazaadenine and C 7 -deazainosine. 

55. The method of claim 31, wherein the deazapurine is 
selected from the group consisting of C 9 -deazaadenine, 
C 9 -deazaguanine and C 9 -deazainosine. 

56. The method of claim 31, wherein the deazapurine is 
selected from the group consisting of C 7 -deazaguanine, 
C 7 -deazaadenine and C 7 -deazainosine. 

57. The method of claim 31, wherein: 

one or more of the nucleotides in the primer, chain- 
terminating nucleoside triphosphates and/or chain- 
elongating nucleoside triphosphates is mass modified. 
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58. The method of claim 43, wherein four sets of base- 
specifically terminated nucleic acid fragments from a 
nucleic acid to be sequenced are generated. 
s 59. The method of claim 44, wherein four sets of base- 
specifically terminated nucleic acid fragments from a 
nucleic acid to be sequenced are generated. 

60. The method of claim 1, wherein four sets of base- 
specifically terminated nucleic acid fragments from a 

10 nucleic acid to be sequenced are generated. 

61. The method of claim 1, wherein the base-specifically 
terminated fragments are linked to a solid support. 

62. The method of claim 1, wherein prior to determining 
15 the molecular weight of the base-specifically terminated 

fragments by mass spectrometry the fragments are exposed 
to a single laser source. 

63. The method of claim 1, wherein the deazapurine is 

„„ C 7 -deazainosine. 

20 

64. The method of claim 20, wherein the deazapurine is 
C 7 -deazaguanine. 

65. The method of claim 20, wherein the deazapurine is 
C 7 -deazainosine. 

25 
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